


Institutional Archive of the Naval Postgraduate School 


Calhoun: The NPS Institutional Archive 
DSpace Repository 


Theses and Dissertations 1. Thesis and Dissertation Collection, all items 


1984-09 


An evaluation of discretized conditional 

probability and linear regression threshold 
techniques in model output statistics 

forecasting of visibility over the North Atlantic Ocean 


Diunizio, Mark 


Monterey, California. Naval Postgraduate School 
http://ndl.handle.net/10945/19309 


This publication is a work of the U.S. Government as defined in Title 17, United 
States Code, Section 101. Copyright protection is not available for this work in the 
United States. 


Downloaded from NPS Archive: Calhoun 


| Calhoun is the Naval Postgraduate School's public access digital repository for 
uh D U DLEY research mate _— and institutional publications c reated by — NPS community. 
«ili | } 7 Calhoun is named for Professor of Mathematics Guy K. Calhoun, NPS's first 
a | KN OX appointed — and published -- scholarly author. 
: ] LIBRARY Dudley Knox Library / Naval Postgraduate School 
411 Dyer Road / 1 University Circle 
Monterey, California USA 93943 





http://www.nps.edu/library 


DUDLEY KNOX LIBRA RARY 
NAVAL POST STGRADUATE SCHOOL 
MONTEREY, CALIFORNIA 93943 








. "Ss 
Pa eine’ 








NAVAL POSTGRADUATE SCHOOL 


Monterey, Galifornia 





UD Peet 


Piet VvASUALTION OF DISCRETIZED CONDITIONAL 

PROBABILITY AND LINEAR REGRESSION THRES- 

DOB eth eCalloukor UN MODEL OULPUP SITAZISTICS 

Boreas liINGuor. VISlBiLity -evER THE NORTH 
ATLANTIC OCEAN 


by 
Mark Piunizio 
September 1984 


Thesis Advisor: Robert J. Renard 





Approved for public release; distribution unlimited 


T222051 





Os A 


SECURITY CLASSIFICATION OF THIS PAGE (When Data Entered) 


READ INSTRUCTIONS 
, 


4. TITLE (and Subtitte) : a 5. TYPE OF REPORT & PERIOD COVERED 
An Evaluation Ou Discretized oman ee Aeneas i > 
bility and Linear Regression Thresho oS September 1984 


in Model Output Statistics Forecasting of Visi- 


bility over the North Atlantic Ocean 


7. AUTHOR(S) 8. CONTRACT OR GRANT NUMBER(S) 










1. 




















Merck DiuniZlo 


9. PERFORMING ORGANIZATION NAME AND ADORESS » PROGRAM ELEMENT, PROJECT, TASK 
AREA & WORK UNIT NUMBERS 


Naval Postgraduate School 
Monterey, California 93943 





11. CONTROLLING OFFICE NAME AND ADDRESS 12. REPORT DATE 
September 1984 
Naval Postgraduate School 13. NUMBER OF PAGES 
Monterey, California 93943 733 


[14. MONITORING AGENCY NAME & AODRESS(If different from Controffing Office) | 1S. SECURITY CLASS. (of thie report) | 


4 


Unclassified | 


DECLASSIFICATION/ DOWNGRADING 
SCHEDULE 











16. DISTRIBUTION STATEMENT (of this Report) 


Approved for public release; distribution unlimited 


17. DISTRIBUTION STATEMENT (ol the abstract entered In Block 20, If different from Report) 


18. SUPPLEMENTARY NOTES 


19. KEY WORDS (Continue on reverse aide if necessary and Identify by block number) 


Model Output Statistics, Visibility, North Atlantic Ocean 
Pasibility, Marine Horizontal Visibility, Discretization, 
Conditional Probability, Physically Homogeneous Ocean Areas, 
Minimum Probable Error Threshold Models, Weather Forecasting, 


20. ABSTRACT (Continue on reverse side if necesaary and identify by block number) 


This report describes the application and evaluation of four 
primary statistical models in the forecasting of horizontal 
marine visibility over selected physically homogeneous areas of 
the North Atlantic Ocean. The main focus of this study is to 
propose an optimal model output statistics (MOS) approach to 
operationally forecast visibility at the 00-hour model initiali- 
zation time and the 24-hour and 48-hour model forecast 


FORM ) 
DD , jan 73 1473 = EDITION OF 1 Nov 6515 OBSOLETE 


SN 0102- LF: 014-6601 ai Pee D 


SECURITY CLASSIFICATION OF THIS PAGE (When Deta Entered) 


SO UNC Ea Sour LED 


SECURITY CLASSIFICATION OF THIS PAGE (When Date Entered) 


#19 =- KEY WORDS - (CONTINUED) 


Maximum-Likelihood-of-Detection Threshold Models, 
Linear Regression, Natural Regression, 
Maximum Probability 


#20 = ABSTRACT — (CONT TIeEE 


projections. The technique utilized involves the 
manipulation of observed visibility and Navy Operational 
Global Atmospheric Prediction System (NOGAPS) model output 
parameters. The models employ the statistical methodolo- 
gies of maximum conditional probability, natural regressio 
and minimum probable error linear regression threshold 
techniques. Additionally, an evaluation of the 1983 
predictive arrays/equations using 1984 NOGAPS data fields 
and a maximum-likelihood-of-detection threshold model were 
accomplished. Results show that two statistical approaches, 
namely a maximum conditional probability strategy were 
lizing linear regression equation predictors and the 
minimum probable error threshold models, produce the best 
results achieved in this study. 


> N 0102- LF- 014-6601 UNCLASS IF LEE 


a  ———— 
2 secuRITY CLASSIFICATION OF THIS PAGE(When Data Enterec) 


2 a ee - 


Approved for public release; distribution unlimited 
An Evaluation of Discretized Conditional Probability 
and Linear Regression Threshold Techniques in Model 


Output Statistics Forecasting of Visibility over the 
North Atlantic Ocean 


by 
Mark Diunizio 


Lieutenant, United States Navy 
B. S., United States Naval Academy, 1977 


Submitted im partral” fulfillment of the 
requirements for the degree of 


MASTER OF SCIENCE IN METEOROLOGY AND OCEANOGRAPHY 


from the 


NAVAL POSTGRADUATE SCHOOL 
September 1984 


ey vse LIERARY 
DU asTeN SUATE SCHOOL 
MONTEREY CALIFORNIA 93243 


ee 


ABSTRACT 


This report describes the application and evaluation of 
four primary statistical models in the forecasting of hori- 
zontal marine visibility over selected physically homogeneous 
areas of the North Atlantic Ocean. The main focus of this 
study is to propose an optimal model output statistics (MOS) 
approach to operationally forecast visibility at the 0U0=neG. 
model initialization time and the 24-hour and 48-hour model 
forecast projections. The technique utilized involves the 
manipulation of observed visibility and Navy Operational 
Global Atmospheric Prediction System (NOGAPS) model output 
parameters. The models employ the statistical methodologies 
of maximum conditional probability, natural regression and 
minimum probable error linear regression threshold tech- 
niques. Additionally, an evaluation of the 1983 predictive 
arrays/equations using 1984 NOGAPS data fields and a maximum- 
likelihood-of-detection threshold model were accomplished. 
Results show that two statistical approaches, namely a maxi- 
mum conditional probability strategy utrlizing Vineas 
regression equation predictors and the minimum probable 
error threshold models, produce the best results achieved in 


this study. 
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I. INTRODUCTION AND BACKGROUND 


One of the most significant advances in objective weather 

MaecictlOon since the introduction of numerical weather 
prediction in the 1950's and satellite remote sensing capa- 
bilities in the 1960's, has been the development of Model 
SGbeue Statistics (MOS) weather forecasting method by Glahn 
miembowny (1972). Tm general, this technique is the deter- 
mination of a statistical relationship between an operational 
weather element (predictand), which may or may not be fore- 
Soe by numerical methods, and numerical model output varia- 
bles (predictors), usually via linear regression methods. 
The resulting predictand/predictor regression equations 
provide the basis for generating a statistical weather pre- 
@iction. The National Weather Service (NWS) has included MOS 
as an integral part of their weather forecasting operations 
eeence the early 1970's. Currently, the NWS maintains MOS 
prediction equations for approximately 15 weather elements 
_reecclling » Visibility, obstructions to vision, precipi- 
Buelom, CLhc-) at forecast times ranging from 6 to 48 hours. 
M@e-se Lorecasts are routinely provided to approximately 295 
Sevilian and 190 military locations throughout the continental 
United States (CONUS) and Alaska [Glahn, 1983]. 

Based on the impressive results achieved with the NWS 


MOS program, the Department of Defense (DOD), through the Air 


io 


Weather Service (AWS), implemented and operated a quasi- 
global version of the NWS MOS system at the Air Force Global 
Weather Center (AFGWC), Offutt AFB, Nebraska [Best and Pryor, 
1983}. The first operational forecasts obtained from the AWS 
MOS system were produced by AFGWC in December 1980 and the 
system ran operationally for a period of approximately 18 
months. Regions for which operational MOS forecasts were 
produced included Europe, Asis (including Korea and Japan), 
the South China Sea (including the Philippines and Taiwan), 
the near and middle east and northern Africa. The AWS MOS 
program was terminated with the recent decision to replace the 
current hemispheric primitive equation (PE) model with a 
spectral global dynamic model [Klein, 1981]. 

Throughout its tenure as an operational forecast scheme, 
the AWS MOS system provided the U.S. Air Force with a rela- 
tively low cost, flexible and responsive prediction network. 
Further development of the AWS MOS system has been postponed 
until sufficient spectral model output is archived. 

The U.S. Navy, by virtue of its unique Marine forecacime 
ing responsibilities, has a keen interest in applying MOS 
forecasting schemes to global oceanic regions. Through the 
research and development efforts of the Naval Environmental 
Prediction Research Facility (NEPRF) in Monterey, California, 
the Navy has sponsored a limited amount of research into naval 
applications of MOS. In particular, statistical studies Have 


been done into forecasting Levante winds in Spain [Godfrey and 


14 


Mewe, 19/9), Ce1ling and visibility prediction in the southern 
California (SOCAL) naval operating area [Lewit, 1980], marine 
aeGmancd Vistollity predictability in the North Pacific Ocean 
[Renard et al., 1983 and Renard and Thompson, 1984]. Presently, 
a program is in operation which provides MOS forecasts for 
selected U.S. Navy and Marine Corps CONUS locations. These 
services, which are made available from NWS, are based on the 
National Meteorological Center (NMC) limited fine mesh model 
predictions. This MOS program was initiated on 10 November 
1982 and provides forecasts for twelve weather parameters 
which include visibility, obstructions to visibility and 

cloud amount [Naval Environmental Prediction Research 
maecrlaty, 1982). 

The results of these limited studies along with the 
encouraging performances of both the NWS and AWS MOS programs 
and the implementation of the Navy Operational Global Atmos- 
pheric Prediction System (NOGAPS) dynamical primitive equation 
(PE) model at the Fleet Numerical Oceanography Center (FNOC), 
in Monterey, California prompted the decision in September 
1982 for the Navy to pursue its own MOS program. 

Fig. 1 is an overview of the currently proposed milestones 
for the Navy MOS program. The first operational weather 
parameter being investigated in this proposed ten-year Navy 
Seeere 15 horizontal visibility at sea, with the initial goal 
of this project being the investigation and development of 
Statistical predictive schemes for forecasting horizontal 


Visibility over the North Atlantic Ocean. 


iD 


The impact of fog and other impediments to visibaycyeen 
naval operations is well documented throughout maritime 
history. Records show countless catastrophes and accidents 
which were directly attributable to poor visibility at sea. 
For example, on 29 May 1914, the Canadian liner Empress of 
Ireland collided with the Norwegian vessel Storstad in dense 
fog on the Saint Lawrence River resulting in 1,024 fatalities 
and similarly, the legendary "North Sea haze" was a critical 
element in the World War I tactics employed at Jutland in 
1916. Also, one of the most spectacular maritime disasters 
in the U.S. Navy's history took place on 9 September 1923 
when seven Pacific fleet destroyers struck the rocks and ran 
aground in dense fog off of Point Arguello, California. 

Research into predicting marine visibility via traditional 
linear regression methodologies has taken place at the Naval 
Postgraduate School (NPS) since the early 1960's. Generally, 
early visibility forecasting experiments identified potential 
physical air/ocean mechanisms [Schramn, 1966] and emphasized 
the inherent likelihood of human error in at-sea visibility 
observations [Nelson, 1972]. Later exper imeneneten by 
Aldinger (1979), Yavorsky (1980) and Selsor (1980) (concer 
trated on various modifications to multiple linear regression 
schemes and the analysis of prediction skill measurements. 

This study presents a direct follow-on to the research 
presented by Karl (1984), in which statistical methodologies 


presented by Preisendorfer (1983a,b,c) and multiple linear 


16 


regression techniques presented by Lowe (1984a) were compared 
and contrastei. In Karl's preliminary study, Preisendorfer's 
three strategies, two based on maximum conditional proba- 
bility and one based on natural regression, as well as 

Lowe's linear regression threshold models were tested and 
applied to sets of FNOC model output parameters (MOPs) from 
both the North Pacific and North Atlantic Ocean areas. The 
North Atlantic Ocean study was separated into effective 
physically homogeneous areas [Lowe, 1984b]. Karl's study 
specifically dealt with an evaluation of the MOS scheme 
applied to oceanic regions for the TAU-00 model output 

emeing the period 15 May to 07 July 1983. 

This study concerns itself with a continued evaluation 
and further refinement of statistical methods proposed by 
Preisendorfer as well as the linear regression threshold 
models presented by Lowe. With reference to Karl's study, 
Other North Atlantic Ocean areas and model forecast projections 


(e.g. TAU-24 and TAU-48) are addressed. 
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Ii. OBJECTIVES AND APPROACH 


The primary objectives of this study are to continue the 
previous NPS horizontal marine visibility predtetionea-= 
Search initiated by Karl (1984) and to continue the search 
for an optimal Model Output Statistics (MOS) prediction 
scheme to operationally forecast coastal and open ocean 
visibility over the North Atlantic Ocean. The approach 
employed in meeting the stated objectives is listed below: 

A. Apply and evaluate the Preisendorfer maximum proba- 
bility and natural regression strategies (1983a,b,c) to addi- 
tional North Atlantic Ocean homogeneous areas [Lowe, 1983b] 
using May through July 1983, NOGAPS predictand/predictor 
data. 

B. Expand the Model Output Predictor (MOP) data sets to 
include the NOGAPS model TAU-00, and the TAU-24 and TAU-48 
prognostic times defined in Chapter III. 

C. Investigate specific two-stage, equal variance and 
quadratic multiple linear regression threshold models pro= 
posed by Lowe (1984a) for the oceanic areas and model output 
periods addressed in A. and B. above. 

D. Compare and contrast the individual resulesterteeve 
Preisendorfer statistical methodologies to those of the Lowe 
appreae. 

E. Conduct a limited series of experiments in which a 


1984 data set, 15 May to 23 June, is utilized as an 
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independence evaluation of the predictive models constructed 


with 1983 NOGAPS data. 


F. Based on A. to E. above, present an interim recommenda- 
tion for an optimal statistical approach to forecast North 
meeanete Ocean horrzontal visibility as a function of pre- 


diction time and homogeneous area. 


i? 


Tit. DATA 





A. VISIBILITY OBSERVATIONS AND SYNOPTTe eee r. 

Horizontal visibility observations taken from seagoing 
platforms are reported as values of ten standardized World 
Meteorological Organization (WMO) synoptic weather codes. 
These codes range in value from 90, which corresponds to 
visibility less than 50 meters, to 99, which corresponds to 
visibility equal to or greater than 50 kilometers. Human 
observational error and inexactness in measuring visibility 
at sea necessitates a generalization of visibility classifi- 


cation for prediction purposes, as follows: 


Visipility Gabeqory Synoptic Code Visibility Range 
i 90-94 < 250 
EE 25 iG > 2 km to < EORSa 


HESIOIE oF 9 10 km 


|v 


The above scheme coincides with the classification scheme 
proposed Be Karl (1984) and is based upon the below listed 
U.S. Navy operational criteria. 

1. 10 km (5 n mi)--U.S. Navy aircraft cCamprvenpeaeea- = 
flight recovery operations change from visual (VFR) to 
controlled (IFR) approach guidelines [Department of the 
Navy, 1979]. 

2. 2 km (1 n mi)--the sounding of reduced) visipreee, 


Signals for all vessels operating in international waters. 
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ine term “reduced visibility” is not specifically defined in 
the International Regulations for Preventing Collisions at 
Sea, 1972. The distance of 1 nmi is generally considered 


to be the governing operational distance. 


B. NORTH ATLANTIC OCEAN DATA 

1. Area 

The North Atlantic Ocean, from 0°-80° N latitude, 

was divided into homogeneous oceanic areas by Lowe (1984b) 
uSing a statistical cluster analysis technique. The specific 
homogeneous areas evaluated in this study are identified as 
areas 2, 3W and 4 on Fig. 2. These areas were selected be- 
Cause they individually represent a range of different rela- 
tive frequencies of poor visibility observations. Area 3W, 
which was used by Karl (1984) for his preliminary experimen- 
tation, represents an area of relatively frequent occurrence 
of poor visibility, while area 4 represents an area of rela- 
tively sparse occurrence of poor visibility and area 2 
represents an intermediate case. 

2. Time Period 

Data from mid-May 1983 to mid-July 1983 were combined 

to form a more extensive data set, hereafter referred to as 
FATJUNE 1983. FATJUNE 1983 was selected as the initial data 
set for statistical experimentation because of the high fre- 
menecy Of occurrence of poor visibility observations for 


meamy areas of the North Atlantic Ocean during this period. 
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1200 GMT synoptic ship report data were used exclusively a 
this study. This time corresponds to general daylight condi- 
tions over the North Atlantic Ocean during FATJUNE. In 
addition to FATJUNE 1983, a limited May 15 to gune 23s 
data set, possessing the same geographical coverage and day- 
light characteristics of FATJUNE 1983, was utilized in an 
independent test of the predictive arrays and equations 
generated in this study. 

For the purpose of this study, TAU-00 generally 
represents six-hour model forecast fields valid at 1200 GMT. 
Three specific fields, namely temperature, geopotential 
height and wind, are model initialization fields valid at 
1200 GMT. TAU-24 and TAU-48 are defined as 24-hour and 
48-hour model forecast fields, valid at 1200 GMT. MTAU-OO, 
TAU-24 and TAU-48 model output parameters (predictors) are 
employed in the 00, 24 and 48 hour forecast schemes, respect- 
fully. Summaries of the visibility frequencies for each 
visibility category, as a function of homogeneous area 
and prediction time, for FATJUNE 1983 and the 15 May to Ze 
June 1984 data set, are contained in Tables I through III 
and Table VI respectively. 

3.) SYEODULIG Weather ehcpe mes 

All synoptic visibility observations (predictand data) 
for this study were provided by the Naval Oceanography Com- 
mand Detachment (NOCD), Asheville, North Carolina which is 


co-located with the National Climatic Data Center (NCDC). 
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The observations which contained systematic observer error 
or were obviously erroneous, as determined from the data 
m@alaty indicators provided with the data, were deleted 
from the working data sets. 

4; Predictor Parameters 

Pifty TAU-00, fifty-four TAU-24 and fifty-four TAU-48 

moage]l Output predictors (MOP's) were provided by the Fleet 
Numerical Oceanography Center (FNOC), Monterey, California. 
These parameters are generated by their current operational 
atmospheric prediction model, the Navy Operational Global 
Atmospheric Prediction System (NOGAPS). All MOP's were 
interpolated from model grid coordinates to synoptic ship 
report position using a linear interpolation scheme. [In 
emeton tO the initial group of model output parameters, 
ten derived parameters representing calculated quantities, 
Such as parameter gradients and products, were included as 
Porenatial predictors. Of the entire group of potential 
meeaictOr parameters, only forty TAU-00 and forty-seven 
TAU-24 and TAU-48 MOP's were actually used to develop the 
various Preisendorfer (1983a,b,c) and linear regression 
threshold models [Lowe, 1983a]. The remainder of the NOGAPS 
model output parameters were dropped from consideration because 
1) the MOP lacked a physical linkage to the visibility pre- 
Seeemand and/or 2) a lack of significant digits (lost during 
miemeransfer of FNOC data to the main computer center's 


mass storage system) rendered the particular MOP useless. 


25 


A list of all available TAU-00, TAU-24 and TAU-48 MOP's are 
included in Appendix B. 

For each homogeneous area and model forecast projec- 
tion, a set of three linear regression equations, in addition 
to the aforementioned MOP's, were included as potential 
MOP's for a separate evaluation of the Preisendorfer methodology 
(the PR+tBMD model). These three predictor equations were 
obtained from two standardized linear regression software 
packages, namely P2R--stepwise regression and P9R--all 
possible subsets regression, as addressed in the BMDP Sta- 
tistical Software [University of Calafornia, (260) ae 
P2R was initially employed in the evaluation of areas 2 and 
4, TAU-00 data, while the P9R program was employed in the 
remainder of the cases studied. The change to the P9R 
program was initiated as a safeguard against any potential 
predictor selection bias incorporated in the P2R software. 
Specific details concerning these statistical software 


packages are addressed in Appendix A. 


C. DEPENDENT/INDEPENDENT DATA SETS 

Due to the limited amount of data available to this 
study for each of the North Atlantic Ocean homogeneous areas, 
it was necessary to withhold a significant amount of the 
observations from the developmental model to use as an 
independent data set. That amount was set as onerthird for 


the experiments reported here. This was accomplished by the 
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use of a counter and transfer statement in the computer 
mA@agrams wiich prevented every third observation from enter- 
ing the developmental computations. To ensure that the 
dependent and independent data were representative of the 
Same population, a 95% confidence interval for proportions 
[Miller and Freund, 1977] was established from the entire 
data set, for each visibility category, and the dependent 

and independent data sets were constrained to have visibility 
frequencies within these established confidence intervals. 
Table IV summarizes the dependent and independent data for 


the North Atlantic Ocean data set. 


Z> 
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A. TERMS AND SYMBOLS 
The terms and statistical symbols defined below will 


be used throughout the remainder of this report. The 


formal mathematical definitions are described in Karl (198475 


1. Maximum probability strategy--choosing forecast 
Visibility category based upon the highest conditional 
probability of visibility within a predictor interval. 

a. MAXPROB I--designation of the maximum probability 
strategy in which ties of the highest conditional probabili- 
ties in a predictor interval are resolved by the generation 
of a random number. 

b. MAXPROB II--designation of the maximum probability 
strategy in which ties of the highest conditional probabili-= 
ties in a predictor interval are resolved by assigning the 
lowest visibility category, of those tied, as the forecast 
Ga eeqory. 

2. Natural regression strategy--choosing forecast visi- 
bility categories based upon the statistical average of the 
conditional probabilities of visibility within a predictor 
interval. 

3. AQ--the probability of a zero-class visibilie #624. -. 
gory forecast error (e.g., if visiblity categon, aa. 


forecast and observed). 
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4. Al--the probability of a one-class visibility category 
forecast error (e.g., if visibility category I is forecast 
and category II is observed). 

5. CE--class error parameter defined as AO0+2Al, used as 
the primary aid in identifying the first predictor. 

6. PP--the potential predictability of visibility by any 
given predictor. 

7. Functional dependence. This is a measure of the 
stochastic dependence of one predictor upon another. Func- 
tional dependence is the probability that one of the predic- 
tors will change when the other does. High functional 
dependence values between one already selected predictor and 
another potential predictor, indicates that little addi- 
tional information beyond the selected predictor is possible. 
Conversely, a low functional dependence value between the 
Same two predictors, indicates that each predictor possesses 
a high degree of linearly uncorrelated information concerning 
the predictand. Functional dependence range is 0.0 to 1.0 
(1.0 = highest functional dependence). The specific deriva- 
tion and mathematical description of the concept of "func- 
tional dependence" is discussed in greater depth by 
Preisendorfer (1983c). 

8. Root-sum-sgquared functional dependence. The functional 
dependence of a predictor on all predictors already included 


in the developmental model. It is equal to the square-root 
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of the sum of the squares of the individual functional 
dependence values. 

9. TSl--threat score for visibility Categoria. 
computed from a contingency table (see Appendix C). 

10. ATSl--adjusted threat score for visibility "categenm, 
I which removes the influence of the data set category 
frequency (see Appendix C). 

ll. AAO--adjusted AO. A contingency table statistic 
which removes the influence of the most frequent visibility 
category ina set of data (similar to a normalized value) 


(see Appendix C). 


Be COMPUTER PROGRAMS 

Four computer programs were developed to test the pro- 
posed Preisendorfer (1983a,b,c) methodology. The programs 
are on file in the Department of Meteorology, Naval Post- 
graduate School, Monterey, California, 93943. 

1. A program to compute AO, Al, CE and PP for all 'prediee 
tors, all strategies (MAXPROB I, MAXPROB II and natural 
regression) for a particular number of equally populous 
predictor intervals. Statistics for the three strategies 
are based upon the same predictor(s) rather than the best 
predictor(s) for each strategy. 

2. A program to compute functional dependence values for 
all predictors, on a given predictor, for a given number of 


equally populous predictor intervals and to compute the 
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mesociatea 96% Critical confidence interval value, referred 
to as functional dependence(96) in this study, by Monte Carlo 
means. 

ees OrOogram tO Construct contingency tables and to 
compute skill and threat scores, for both the dependent and 
independent data sets. 

4. <A program to generate 100 random data sets, from the 
Marginal probabilities of the predictor(s) in the develop- 
mental model, and to compute upper and lower 5% critical 
confidence interval values for AO and Al to be used for 
testing the significance of the results from each of the 
Preisendorfer mcdels against chance. These confidence 


mirerval values are calculated via Monte Carlo means. 


fee) MODELS 
ieee ocisendorter PR Model 

This model represents the first of two different 
applications of the basic Preisendorfer methodology 
mereisendorfer, 1983a,b,c]. Karl (1984), in his preliminary 
research, provides a rigorous interpretation and results 
associated with this statistical forecasting methodology. 
Karl's study provides the necessary background for the con- 
tinued investigation and evaluation of this model and readers 
interested in specific details are advised to consult this 
document. 

The PR model utilizes the working set of NOGAPS 


model output parameters (MOP's) and derived parameters 
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(Appendix B) as potential predictors in consteuctingea 
developmental model, based upon the dependent data set, 
which provides the structure by which the independent data 
set is tested and evaluated. In general, these potential 
predictors have their range of values partitioned into 
discretized equally populous predictor intervals (celta 
and conditional probabilities of the predictand are calcu- 
lated according to the three modified visibility categories 
(VISCAT) I, II and III. Three separate strategies of deter- 
mining the specific VISCAT which is to be identified with 
each predictor value, are proposed. These strategies, two 
based upon maximum probability and the third based ona 
natural regression approach, are addressed as MAXPROB I, 
MAXPROB II and natural regression in the remaining portions 
Of this vscudy. 

Initial evaluation of this model involves varying the 
equally populous predictor intervals from sizes of four 
through ten, and selecting an optimal first predictor which 
provides one of the following requirements in the designated 
order: 

a. the lowest CE value of all the potential predictors 
b. the highest PP value of all the potential predictors 

Once a first predictor is identified for each of the 
four through ten equally populous predictor intervals, 
corresponding VISCAT I, II and III threat and AO skill 


scores (Appendix E) are calculated for both the dependent 
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and independent data sets. The practice of selecting an 
optimal equally populous predictor interval from the eligible 
Srouping sizes of four through ten, was proposed by Karl 
(1984) as a practical procedure which would permit the 
realization of peak skill scores as well as maintain asso- 
ciated computer storage requirements at a manageable level. 
An unfortunate consequence of this range of potential group- 
ing sizes is that certain statistical calculations associated 
with equally populous predictor intervals of eight, nine 

and ten are terminated before completion due to a two mega- 
byte storage ceiling at the NPS W.R. Church Computer Center. 
When considering potential predictor intervals, the size of 
the interval is of obvious importance, with lower values 
being the most desirable. The criterion for determining the 
optimal equally populous predictor interval is to select the 
smallest interval value which maximizes the dependent data 
set adjusted AO and independent adjusted VISCAT I threat 
score. For this study, this interval value was fixed for 
all ensuing aspects of the model evaluation. In practice, 
the selection of equally populous predictor intervals was 
based upon the initial adjusted AO (dependent data) and the 
adjusted VISCAT I threat score (independent data) for the 
MAXPROB II strategy. The MAXPROB II scores were routinely 
found to be the highest for each case evaluated, at this 
early stage in the evaluation process, and therefore used 


as the basis for grouping selection. As the equally populous 
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grouping interval remains constant throughout the Preisen- 
dorfer models, the MAXPROB I and natural regression strate- 
gies practically play no role in the predictor selection 
process. 

Once the first predictor and its associated equally 
populous predictor interval have been identified, a func- 
tional dependence test of the first predictor against those 
remaining potential predictors is run. The second, third 
and all subsequent predictors are selected Only sti epe sau 
the following criteria vane mec: 


a. subsequent predictors must increase AO over the 
AO value attained at the preceding level, and 


b. the selected predictor must have the lowest 
functional dependence and root-sum-square 
functional dependence of all the remaining 
potential predictors. 

After each predictor selection stage has been com- 
pleted, significance tests are run upon the dégelopmen tone 
model to determine if the results are suitably significant 
as compared to random chance. This testing is accompiLdsamie 
Via Monte Carlo testing methods using the conditional 
probabilities of the selected predictors and assuming equal 
probability of occurrence for the three modified visibility 
categories. Functional dependence/root-sum-square functional 
dependence, AQ, and Al statistics are calculated for each of 
100 randomly generated data sets. For the developmental 
model to yield results which are significant at the speci- 


fied confidence interval values, each one of the following 


criteria must be met: 
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a. AO must be equal to or greater than A0(96) 

b. Al must be equal to or less than A1(05) 

c. the functional dependence value for a selected 
predictor must be less than functional 
dependence (96) 

As with the process of selecting equally populous 
Peearceor intervals, the AO, AO({96), Al and Al(05) statistics 
(Appendix G), reflect scores for the MAXPROB II strategy. 

The AO statistics routinely were found to be the highest for 
this strategy and thus were used as the basis for ensuring 
the aforementioned predictor selection criteria were met. 
However, the MAXPROB I strategy often produced AO values 
identical to MAXPROB II. The natural regression strategy 
regularly lagged the two maximum probability strategies in 
AO and Al scores and consequently played no real role in the 
prediction selection process. Specific trends in AO/Al 
scores can be seen in Appendix G. 

From a practical standpoint, the model development 
continues until computer storage limitations preclude further 
@eaieton Of predictors. This generally occurred at the fifth 
predictor level. 

Once the developmental model is completed, contingency 
tables of forecast visibility category versus observed visi- 
bility category are constructed for both the dependent and 
independent data sets, and threat and skill scores are 


computed and compared. 
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2. Preisendorfer PR+BMD Model 

This model is still the PR model described above. 
Now, sets of three linear regression equations (Appendix D) 
are added to the list of potential NOGAPS model output and 
derived predictor parameters. The inherent difference of 
these predictors is evidenced in both the prediccor Scilla 
tion process as well as in the resulting skill and threat 
scores, as will be demonstrated in Chapter V. 

3. Equal Variance Threshold Model (EVAR) 

This model represents the first of two threshold 
models, developed by Lowe (1984a), which were evaluated in 
this study. The model uses an algorithm which requires the 
assumption that the variances of two normally distributed 
populations which are to be separated by a threshold value 
are equal, while their means are unequal. A detailed dis- 
cussion of the theoretical background of this scheme is 
addressed in Appendix A. 

A two-stage separation scheme was used to effectively 
divide the visibility categories (VISCAT) I, II and III 
into a first-stage VISCAT I versus a combined VISCAT TPp ies 
VISCAT III separation, and subsequently VISCAT II versus 
VISCAT III separation for each homogeneous area and model 
output time. This separation was accomplished by setting 
all VISCAT I observations equal to an arbitrary integer value 
of zero and the combined VISCAT II plus VISCAT III observations 


equal to an arbitrary integer value of one and generating 
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a linear regression equation to suitably describe the 
resulting two distributions. This linear regression equation 
was then used in the graphical plotting program BMDP5D, 

from the BMDP Statistical Software [University of California, 
1983], to generate a set of histograms describing the first 
stage separation. Included with the graphical histogram 
output is a listing of the individual frequency of observa- 
tion (P), mean (uu) and standard deviation (co) of each of the 
specified visibility distributions. These statistics are 
incorporated into the equal variance threshold algorithm 

and a corresponding threshold value is calculated. 

Following the first-stage threshold calculation, a 
second linear regression equation is generated, based upon 
eeey those VISCAT II plus VISCAT III observations which 
exceed the previously calculated threshold value. This 
effectively eliminates any VISCAT II plus VISCAT III obser- 
vations less than the threshold value (1.e., those observa- 
tions contained in the tail of the distribution), from being 
included in the second-stage regression. The previous proce- 
dure of generating corresponding histograms and statistics 
is repeated, based upon all VISCAT II observations being 
assigned an arbitrary integer value of zero and all VISCAT 
III observations being assigned an integer value of one. A 
second-stage equal variance threshold value is then calcu- 


lated which separates VISCAT II from VISCAT III. 
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With the two-stage separation complete, the indepen- 
dent data set is processed through the governing equations 
and thresholds to obtain a set of observed visibility value 
results versus calculated "forecast" visibility value re- 
sults. These results, in contingency table format for each 
evaluated case, are presented in Chapter V and Appendix G. 

4. Quadratic Threshold Model (QUAD) 

This model represents the second of two threshold 
models, developed by Lowe (1984a), which were evaluated in 
this study. The model uses an algorithm which requires the 
assumption that both the variances and the means of two 
normally distributed populations, which are to be separated 
by a threshold value, are equal. Similar to the EVAR model, 
a detailed discussion of the theoretical background of this 
scheme is addressed in Appendix A. 

The general two-stage separation procedure employed 
with this model is identical to that described for the EVAR 
model in IV.C. above. The only difference between the QUAD 
and EVAR model is the algorithm, based upon a solution to 
a quadratic equation in this model, used to calcullave@ege 
appropriate threshold values. 

5. Maximum-Likelihood-of-Detection Model 

The maximum-likelihood-of-detection criteria (MLDC) 
is an additional threshold technique which is included in 
this study as a possible alternative to the aforementioned 


EVAR and QUAD minimum probable error threshold models. The 
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MLDC involves calculating a threshold value based upon the 
assumptions that the population frequencies and variances 

of two normally distributed samples which are to be separated 
are identical. This technigue is particularly well suited 
for cases where the threat frequency (1.e., number of 
Beeeacening events divided by the total number of threat and 
non-threat events) approaches very small values (e.g., 
Statistical rare events). 

Unlike the EVAR and QUAD models, the two-stage 
separation employed with this technique utilizes a first- 
meee VviSCAT I+1ll versus VISCAT III followed by a second- 
BeaGe VISCAT I versus VISCAT II separation. In calculating 
Eae Specific threshold values, the lowest frequency visibility 
category (usually the VISCAT I threat category) is assigned 
an arbitrary integer value of one. The remaining larger 
Visibility category/ies are assigned the arbitrary integer 
Mewue Of zero. Proceeding in the same manner as described 
with the EVAR and QUAD models, population means are calcu- 
lated for each separation stage. The threshold value is 
Simply the mid-point between the two population means. A 
detailed discussion of the theoretical background of this 


scheme is addressed in Appendix A. 
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Ve RESULTS 


The general procedures outlined in Chapter I were £fol-= 
lowed in evaluating the statistical scoring techniques for 
the oceanic homogeneous areas 2, 3W and 4. Certain siiloite 
modifications were required to handle the relatively low 
frequency of visibility category I, in area 4 for the TAU- 
00, TAU-24 and TAU-48 model output data sets. Fig. 2 
displays the individual oceanic homogeneous areas for FATJUNE 
1983. Tables I through III identify the frequency of occurs 
rence of visibility categories I, II and Iii at TAU_og 
TAU-24 and TAU-48 for each of the evaluated homogeneous 
areas. 

In discussing the results of this study, specific comment 
1s focused upon the optimal model for each case as well as 
any Significant finding observed by the author. Certain 
characteristics of the evaluated cases are repebaneiene and 
are considered adequately described by their associated 
figures. Consequently, the entire assemblage of figures in 
Appendix G are not individually addressed. These figures 
are nevertheless considered noteworthy, as they document the 
performance of each tested model in this study, and are 
included as a matter of record. The following presentation 
of the results of this experimentation are arranged accord- 


ing to the specific oceanic homogeneous area and model 


Gut put period: 
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DMmeroicoati tour models) are evaluated for each of the 
pecacfined hemogeneous areas/model forecast projections. The 
four models are: the Preisendorfer methodology utilizing 
NOGAPS model output predictors and a limited number of 
derived predictors (PR), the Preisendorfer methodology uti- 
mezang both NOGAPS model output predictors, derived predic- 
tors and linear regression equation predictors (PR+BMD), an 
equal variance linear regression threshold model (EVAR) and 


a quadratic linear regression threshold model (QUAD). 


A. NORTH ATLANTIC OCEAN, AREA 2 

Area 2 encompasses a geographic region that extends from 
the southeastern tip of Newfoundland, across the North 
Atlantic Ocean to the eastern coast of England, north to 
the Five Fingers of Iceland and back to the Canadian coast 
north of Newfoundland. Fig. 2 gives the pictorial repre- 
sentation of the area. 

1. Area 2, TAU-O0O 

Fig. 3 shows the relationship of equally populous 

grouping size to the adjusted AO (dependent data) and the 
adjusted VISCAT I threat score (independent data) for the 
PR model. For this case, a grouping size of eight was se- 
lected. Results of the individual MAXPROB I, MAXPROB II 
and Natural Regression strategies are shown in Figs. 4a 
though 4c. The MAXPROB II strategy (Fig. 4b) produced the 


Wargest overall independent data VISCAT I adjusted threat 
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score, namely 0.23 (unadjusted, 0.30). This peak thpeaet 
score occurs with the inclusion of the first preamecem 
E850, and declines marginally with the addition of the re- 
maining four predictors. Of the three strategies, the 
natural regression strategy (Fig. 4c), yields the poorest 
overall threat scores with its peak threat score occurring 
with the addition of the fourth predictor. The predictors 
selected for this case are E850, ENTR, DVDP, U1O0O0O, and 
Suis 

The associated functional dependence and AO/Al sta- 
tistics and 96%/05% confidence interval values for these 
predictors are shown in Fig. 5. The trend of functional 
dependence versus its 96% confidence interval shows that the 
specific functional dependence values associated with the 
chosen predictors never falls within the 96% confidence 
interval. At the first predictor level, for example, the 
functional dependence of ENTR upon E850 has a value of 0.1146 
as compared to a 96% confidence interval value of 0.1039. 
This infers that the corresponding scores (1 ese ae 
scores, AQ and Al) are not statistically Significantyaeged. 
preselected 96% confidence interval level. 

Fig. 6 shows the relationship of equally populous 
grouping size to the adjusted AO (dependent data) and the 
adjusted VISCAT I threat score (independent data) for the 
PR+BMD model. For this case an equally populous grouping 


size of seven was selected. Results of the three wimaiy meme 
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Preisendorfer strategies, along with the corresponding con- 
tingency tables, can be seen in Figs. 7a through 7c. As 
with the PR model, a maximum independent VISCAT I threat 
score was obtained with the MAXPROB II strategy using the 
first predictor selected, namely the linear regression equa- 
meee predictor BMD] (Appendix D). The overall independent 
adjusted VISCAT I threat score achieved with this model is 
fez (unadjusted, 0.36), which is .06 greater than that for 
the PR model. The natural regression strategy (Fig. 7c) 
Provides the poorest resultant threat scores and these reach 
their peak with the inclusion of the fifth predictor. The 
Paedtctors selected for this case are BMD1, ENTR, DVDP, 

PS and PBLD. 

The functional dependence, A1/AO statistics and 963%/ 
05% confidence interval values for this model can be seen in 
Fig. 8. As with the PR model, the specific functional 
dependence values associated with the selected predictors 
never fall below the calculated 96% functional dependence 
confidence interval. 

Figs. 9 and 10 show the contingency tables results 
for the EVAR and QUAD threshold models. For each of these 
models the independent adjusted VISCAT I threat scores have 
identical values of 0.32 (unadjusted, 0.38). 

The two-stage linear regression sequence employed 
for both of these threshold models yields very similar basic 


Statistics. For the EVAR model, a threshold value of 
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0.648497 was calculated for the first-stage VISCAT I versus 
VISCAT II+III separation. This threshold was based upon a 
VISCAT I sample size of 190 observations, a mean of 0.659 
and standard deviation of 0.205 and a combined VISCAT II+III 
sample size of 1722 observations, a mean of 0.927 and 
standard deviation of 0.122. The second-stage VISCAT II 
versus VISCAT III separation was based on a calculated 
threshold of 0.580128. Associated with this threshold value 
were 311 VISCAT II observations with a mean of 0.708 and 
standard deviation of 0.142 and 1473 VISCAT III observations 
with a mean of 0.850 and standard deviation of 0.131. 

For the QUAD model, a threshold value of 0.642104 
was calculated for the VISCAT I versus VISCAT II+III first- 
Stage separation, based upon the sample addressed above. A 
second-stage quadratic threshold separating the VISCAT II 
and VISCAT III samples was calculated to be 0.580569. fThis 
VISCAT II sample contained 358 observations with a mean of 
0.643 and standard deviation of 0.142 while the VISCAT III 
Sample contained 1402 observations with a mean of 0.846 and 
Standard deviation of 0.140. 

While no significant difference appears to exist 
between the results of the two threshold models, the QUAD 
model yields a slightly higher AO and slightly lower Al 
values for both the dependent and independent data sets. 

Table V shows a synopsis of the key statistical 


results for this case. The best models, as determined by 
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independent adjusted VISCAT I threat scores, are the two 
threshold models. Of these two models, the QUAD model 
achieves the highest adjusted AO, namely 3.16% (unadjusted, 
oO. 73%). 
2. Area 2, TAU-24 

Fig. ll shows the relationship of equally populous 
grouping size to the adjusted AO (dependent data) and the 
adjusted VISCAT I threat score (independent data) for the 
PR model. For this model, adjusted dependent AO values of 
-0.03 and adjusted independent threat score of -.01 were 
obtained for grouping sizes four through nine. At the 
grouping size of ten, a jump in scores was realized and 
thus ten is identified as the only possible selection. An 
associated difficulty in utilizing a grouping size of eight, 
nine or ten, is that local computer storage resources are 
limited to two megabytes. This decreases the usual five 
predictor array to only four predictors as witnessed in this 
case. The results of the three Preisendorfer strategies are 
shown in Figs. l2a through 1l2c. For this model, the MAXPROB 
I and MAXPROB II strategies yield identical maximum independent 
adjusted VISCAT I threat scores of 0.21 (unadjusted, 0.27). 
For each of the maximum probability strategies, an initial 
mmeedt SCOre Of 0.19 (unadjusted, 0.25) was achieved with 
eimewrirst predictor, E850, solely. The slight increase to 
the overall peak threat score was obtained with the inclusion 


of the second predictor, ENTR, with subsequent independent 
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VISCAT I threat scores decreasing at the third wand eteuman 
predictor levels. Of the three strategies, natural regression 
(Fig. l2c) yielded the poorest overall threat score and per= 
cent correct values. These relative peak scores for the 
natural regression strategy occur with the inclusionsor eee 
fourth and final predictor. The predictors selected for this 
model were: E850, ENTR, DVDP and DIV925. 

The associated functional dependence, AO/Al1 statis- 
tics and 96%/05% confidence intervals for this model are 
Shown in Fig. 13. For this case, the third and fourth pre- 
dictors' root-sum-square functional dependence values exceed 
the associated 96% confidence interval values, indicating 
Significant statistical interdependence of these predictors 
at this confidence interval level. 

Fig. 14 shows the relationship of equally populous 
grouping size to the adjusted AO (dependent data) and the 
adjusted VISCAT I threat score (independent data) for the 
PR+BMD model. The dramatic increase in independent threat 
score at grouping size of seven identifies it as the optimal 
selection. The results of the three Preisendorfer strategies 
are shown in Figs. 15a through 15c. For this model, the 
MAXPROB I and MAXPROB II strategies yield identical maximum 
independent adjusted VISCAT I threat scores of 0.26 (unad- 
justed, 0.32). This peak score was achieved with the inclu- 
sion of the first predictor. In this case, the first seveqees 


predictor is the second generated linear regression equation 
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ec rcecommmeiee (Appendix DBD). Following the initial threat 
score maxima, the scores decreased with the addition of the 
subsequent four predictors. While some fluctuation in the 
threat score trend was observed with the MAXPROB II strategy, 
independent VISCAT I threat scores never surpassed their 
initial maximum value. Of the three strategies, natural 
regression (Fig. 15c) provides the poorest overall indepen- 
dent VISCAT I threat score of 0.21 (unadjusted, 0.28). This 
score was achieved with the addition of the fifth and final 
predictor. The predictors selected for this model were: 
meoe2, VRT9I2Z5, ENTR, UlLOOO and RH. 

The associated functional dependence, AO/Al statis- 
tics and 96%/05% confidence intervals are shown in Fig. 16. 
For this model, a comparison of functional dependence and 
functional dependence 96% confidence interval values indi- 
cates that the final three predictors have root-sum-square 
functional dependence values which are too large to ensure 
Significant statistical independence at the 96% confidence 
interval level. 

Figs. 17 and 18 show the contingency tables and 
associated statistics for the EVAR and QUAD threshold models. 
For each of the models, the independent adjusted VISCAT I 
threat scores have identical values, namely 0.29 (unadjusted, 
0.24). The two-stage linear regression sequence employed 
For both of these models yields fairly similar statistical 


results. For the EVAR model, a threshold value of 0.674932 
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was calculated for the first-stage VISCAT I versus VISCAT 
II+III separation based upon a VISCAT I sample size of 180, 
a mean of 0.682 and a standard deviation of 0.227 and a 
VISCAT II+III sample size of 1580, a mean of 0.938 anda 
standard deviation of 0.109. The second-stage VISCAT II 
versus VISCAT III separation was based upon a calculated 
threshold value of 0.601717. Associated with this threshold 
were 300 VISCAT II observations with a mean of 0.733 and 
standard deviation of 0.149 and 1339 VISCAT III observatiane 
with a mean of 0.857 and standard deviation of 0.121. 

For the QUAD model, a threshold value of 0.675210 
was calculated for the first-stage VISCAT I versus VISCAT 
II+III separation based upon the sample statistics addressed 
above. The second-stage threshold separating the VISCAT II 
and VISCAT III samples was calculated to be 0.617455. The 
VISCAT II sample contained 300 observations with a mean of 
0.739 and a standard deviation of 0.125. The Vises awe 
sample contained 1339 observations with a mean of 0.885 and 
standard deviation of 0.118. 

While the VISCAT I threat scores for both the dependent 
and independent data sets are identical for the two models, 
differences in other statistics are apparent. The EVAR model 
(Fig. 17), for example, has the higher independent adjusted 
AO scores, namely 2.96% (unadjusted, 81.34%), as compared to 
scores of -63.31% (unadjusted, 68.60%) for the QUAD model 


CEaEC cule 
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Mime lemon remwecivead 2, TAU-24, the threshold «models 
again provide the highest independent VISCAT I threat scores 
(Table Vv). Of the two threshold models, the EVAR model has 
a slight edge in AO scores. 

3. Area 2, TAU-48 

Fig. 19 shows the relationship of equally populous 
Peouping Size tO the adjusted AO (dependent data) and the 
adjusted VISCAT I threat score (independent data) for the 
PR model. For this model, the initial peak values of dependent 
aumanmca Imaecpendent VISCAT I threat score at thesgrouping 
eeeze Of four did not sufficiently ascertain four as the 
optimal grouping selection. For this grouping size, the 
second selected predictor ENTR had a functional dependence 
Oe. 2952 as compared to the calculated functional dependence 
96% confidence interval value of 0.1932. The large dis- 
parity between the two functional dependence values indicates 
a Significant statistical correlation between E850 and ENTR 
at a grouping size of four and thus grouping size four was 
meeppea from consideration. The selected grouping size of 
nine, which unfortunately carries with it the requirement 
of a very large computer storage forecast array at the fifth 
predictor level, had a functional dependence value 0.0930 
es compared to a functional dependence 96% confidence interval 
value of 0.0970 and thus was selected as the optimal grouping 
Size. The associated functional dependence, AO/Al statistics 


mee 26% Confidence intervals are shown in Fig. 20. The first 
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three predictors selected have functional dependence values 
sufficiently low enough to ensure no significant predictor 
interdependence. 

The results of the three Preisendorfer strategies 
are shown in Figs. 2la through 2lc. The maximum independent 
VISCAT I threat score achieved for the three strategies was 
0.17 (unadjusted, 0.26) and was obtained with the MAXPROB II 
strategy with the addition of the fifth predictor. It should 
be noted that the independent adjusted VISCAT (etarea. 
scores achieved by both the MAXPROB I and MAXPROB II strate- 
gies reached near peak values of 0.16 (unadjusted 7 ™Gm as 
with the addition of the second predictor, thus greatly mie 
mizing the size of the associated forecast array. Of the 
three strategies, natural regression (Fig. 2lc) yielded the 
poorest overall adjusted independent VISCAT I threat score, 
namely 0.09 (unadjusted, 0.18). This score was achieved 
with the inclusion of the fourth predictor in the forecast 
array. The predictors selected for this model were E850, 
ENTR, DVDP; DRAG and BiyoZs- 

Fig. 22 shows the relationship of equally populeue 
grouping size to the adjusted AO (dependent data) and the 
adjusted VISCAT I threat score (independent data) ienmere 
PR+BMD model. For this model a grouping size of nine was 
selected. The results of the MAXPROB I, MAXPROB II and 
natural regression strategies are shown in Figs. 23a through 


23c. For this model, MAXPROB I and MAXPROB II provide 
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identical maximum independent adjusted VISCAT I threat scores 
of 0.31 (unadjusted, 0.37). These scores were achieved with 
the inclusion of the second linear regression equation pre- 
dictor BMD2 (Appendix D). For each of these strategies, the 
independent VISCAT I threat scores decrease with the addi- 
tion of the second and subsequent predictors. While a slight 
upward progression is noticed with the MAXPROB II strategy, 
the peak score observed at the first predictor level is 

never surpassed. Of the three Preisendorfer strategies, 
natural regression (Fig. 23c), yields the poorest overall 
independent VISCAT I threat score, namely 0.18 (unadjusted, 
Meeeo). This score occurs with the inclusion of the fifth 
predictor and culminates in a slow increase in threat score 
as each predictor is sequentially added to the forecast array. 
The predictors selected for this model were BMD2, VRT925, 
ENTR, U500 and DRAG. 

Fig. 24 shows the functional dependence, AO/A1l sta- 
tistics and 96%/05% confidence interval values for the 
selected predictors. For this model, the second and third 
predictors' functional dependence values fall below the 96% 
confidence interval and thus are not Significantly inter- 
dependent upon one another. This trend changes with the 
fourth and fifth predictors which have functional dependence 
values greater than the calculated 96% confidence interval 
values. 

Figs. 25 and 26 show the contingency table results 


for the EVAR and QUAD threshold models. For each of these 
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models, the independent adjusted VISCAT I threat scores 
have identical values of 0.21 (unadjusted, 0.29). 

The two-stage linear regression sequence used to 
separate the three visibility categories yield very similar 
results for the two threshold models. For the EVAR model, 

a threshold value of 0.652554 was calculated for the first= 
stage VISCAT I versus VISCAT II+III sample separation. 

This threshold value is based upon a VISCAT I sample size 
of 182 observations with a mean of 0.686 and a standard 
deviation of 0.267 and a combined VISCAT II+III sample of 
1670 observations with an associated mean of 0.930 and 
Standard deviation of 0.106. The second stage VISCAT II 
versus VISCAT III regression separation yielded a threshold 
value of 0.572257 based upon 355 ViISGar an obser varehienen with 
a mean of 0.711 and standard deviation 0.135, and 1408 
VISCAT III observations with a mean of 0.834 and a standard 
deviation, of OSS 0- 

For the QUAD model, a very similar threshold value 
of 0.652554 was calculated for the first-stage VISCAT I 
versus VISCAT II+III separation based upon the sample first- 
stage statistics addressed above. A second-stage threshold 
value of 0.564579 was calculated based upon 330 VISCAT If 
observations with a mean of 0.724 and standard deviation of 
0.128, and 1407 VISCAT III observations with a mean of 0.833 


and a Standard deviation of Ue 
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In general, the results of these two threshold models 
are nearly identical. The EVAR model shows a very slight 
advantage in adjusted independent AO scores, namely 7.07% 
(unadusted, 80.11%) as compared to 5.05% (unadjusted, 

79.68%) for the QUAD model. Similarly, the EVAR model yielded 
a slightly higher independent adjusted threat score for VISCAT 
mecombined with VISCAT II of 0.02 (unadjusted, 0.23) versus 

an adjusted score of 0.01 (unadjusted, 0.22) for the QUAD 
model. 

For Area 2, TAU-48 the PR+BMD model provides the 
highest overall independent VISCAT I threat score (Table V). 
The difference between the independent adjusted VISCAT I 
threat scores for the PR+BMD model and the two threshold 
models iS minimal, namely 0.02, while the PR model is 0.14 


lower. 


B. NORTH ATLANTIC OCEAN, AREA 3W 

Area 3W was the North Atlantic homogeneous area selected 
by Karl (1984) for his initial TAU-00 MOS experimentation. 
This area borders the United State's eastern seaboard from 
the vicinity of Cape Charles, Virginia to the southeastern 
tip of Newfoundland. The area encompasses a large portion 
of the Georges Banks region and extends to approximately 
45° W longitude. The specific detail and proximity of this 
area can be seen in Fig. 2. 

Area 3W constitutes the homogeneous area with the highest 


relative frequency of VISCAT I observations with approximately 


Bal 


19% of the total number of visibility observations being less 
than 2 kilometers in the TAU-00, TAU-24 and TAU-48 periods. 
The TAU-24 and TAU-48 prognostic periods will be addressed 
in this document. The reader is advised to consult Karl 
(1984) for detailed information concerning area 3W, TAU-O0O. 
l. Area 3W, TAU-24 

Fig. 27 shows the relationship of equally populous 
grouping size to the adjusted AO (dependent data) and the 
adjusted VISCAT I threat score (independent data) for the 
PR model. For this case a grouping size of six was selected. 
Results of the three Preisendorfer strategies are shown in 
Figs. 28a through 28c. The MAXPROB II strategy achieves a 
slightly higher independent adjusted VISCAT I threat score 
of 0.21 (unadjusted, 0.36) as compared to a score of 0.20 
(unadjusted, 0.35) for the MAXPROB I wicEnoe For each of 
these strategies, the maximum threat score is reached with 
the inclusion of the fifth and final predictor in tEhemonae 
cast array. The general trend of these two strategies is 
nearly identical and show an initial rise in threat score 
at the first predictor level, a slight decrease with the 
addition of the second and third predictors and a secondary 
rise at the fourth and fifth predictor levels. The poorest 
results for this case were achieved with the natural regres- 
sion strategy (Fig. 28c), for which an independent adjusted 
VISCAT I threat score of 0.16 (unadjusted, 0.32) was achieved. 


This score was similarly reached with the addition of the 
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fifth and final predictor. The predictors selected for this 
model were DTDP, SHWRS, ENTR, UlLOOO and DUDP. 

Fig. 29 shows the functional dependence, AOQ/Al 
statistics and 96%/05% confidence intervals for this model. 
For this case only the second predictor has a functional 
dependence value which falls below the corresponding 96% 
confidence interval and thus meets the requisite conditions 
regarding predictor interdependence. Consequently, the 
greatest independent threat score achieved, which coinci- 
dently meets the functional dependence criteria, occurs with 
the MAXPROB II strategy at the inclusion of the second pre- 
@ictor. The threat score achieved in this particular instance 
ieema Value of 0.13 (unadjusted, 0.30). 

Fig. 30 shows the relationship of equally populous 
grouping size to the adjusted AO (dependent data) and the 
adjusted VISCAT I threat score (independent data) for the 
PR+BMD model. For this case a grouping size of five was 
selected. Results of the MAXPROB I, MAXPROB II and natural 
regression strategies are shown in Figs. 3la through 3lc. 
For this model, the two maximum probability strategies pro- 
vide identical peak independent adjusted VISCAT I threat 
scores of 0.28 (unadjusted, 0.42) at the first predictor 
level. For both of these strategies, the addition of subse- 
quent predictors produces a steady drop off in threat score 
values. The poorest overall results for this case are 


achieved with the natural regression strategy (Fig. 3lc). 


53 


This method yields an independent adjusted VISCAT I threat 
score of 0.17 (unadjusted, 0.33) which was obtained with the 
addition of the fifth and final predictor. The predictors 
selected for this model were BMD1, D500, DVDP, ENTR and 
US50. 

Fig. 32 shows the associated functional dependence, 
A1/AO statistics and 96%/05% confidence interval values for 
the predictors chosen for this model. The functional depen- 
dence versus the 96% confidence interval follows a peculiar 
trend where the second predictor is significantly dependent 
upon the first predictor but the third and fourth prediceame 
are conversely sufficiently uncorrelated with the prior 
predictors to ensure no significant functional dependence. 
The final Suecioton returns to being functionally dependent 
upon the previous predictors. This trend indicates that the 
relative contribution of the second and subsequent predictors 
is statistically hot significant at the preselected 96% 
confidence interval level. 

Figs. 33 and 34 show the contingency table results 
for the EVAR and QUAD threshold models. The results of these 
models are very Similar with the EVAR model yielding an 
independent adjusted VISCAT I threat score of 0.17 ({(unad- 
justed, 0.33) as compared to a corresponding Ehreareeecee 
of 0.16 (unadjusted, 0.32) for the QUAD medely 

For the EVAR model, a first-stage threshold value O£f 


0.561855 was calculated based upon 270 VISCAT I observations 
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miei a mean Gt 0.590 and Standard deviation of 0.203 and 
1145 VISCAT II+III observations with a mean of 0.861 and 
standard deviation of 0.168. The second-stage VISCAT II 
versus VISCAT III separation was based upon a calculated 
threshold value of 0.542363. Associated with this threshold 
were 299 VISCAT II observations with a mean of 0.647 and 
standard deviation of 0.146 and 938 VISCAT III observations 
with a mean of 0.794 and standard deviation of 0.153. 

For the QUAD model, a similar threshold value of 
0.5559971 was calculated based upon the first-stage regres- 
Sion separation listed above. A second-stage threshold 
value of 0.540874, separating VISCAT II from VISCAT III, was 
calculated based upon 305 VISCAT II observations with a mean 
Sue Ol ana Standard deviation of 0.157 and 940 VISCAT III 
observations with a mean of 0.793 and standard deviation 
ee 0.154. 

In general, the PR+BMD model produced the best overall 
results for this case, followed by the PR model and lastly 
the two threshold models (Table V). The independent adjusted 
AO score of the PR+BMD model, which corresponds to the maxi- 
mum independent adjusted VISCAT I threat score, is similarly 
a maximum value for this case, namely 21.68% (unadjusted, 
ror. 96%) . 

2. Area 3W, TAU-48 
Fig. 35 shows the relationship of equally populous 


grouping size to the adjusted AO (dependent data) and the 
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adjusted VISCAT I threat score (independent data) for the PR 
model. For this model, an equally populous grouping siZomem 
Six was selected. The results of the three Preisendorfer 
Strategies are shown in Figs. 36a through 36c. For this 
case, the MAXPROB II strategy achieves the highest indepen- 
dent adjusted VISCAT I threat score of 0.18 (unadjusted, 
0.33) as compared to 0.1/7 (unadjusted, 0.32) for the MAxXBRgs 
I strategy and 0.12 (unadjusted, 0.22) for natural regression. 
The maximum score for each of the three methods was achieved 
with the addition of the fifth and final predictor, ai 
statistical score trends for the two maximum probability 
strategies are very similar and reach identical near peak 
independent VISCAT I threat scores of 0.15 (unadjusted, 0.30) 
at the first predictor level. This is particularly note- 
worthy when considering that the computer forecast earmag 

Size may be of Significant operational concern. The poorest 
strategy for this case 1s natural regression. The predictors 
selected for this case are DTDP, SHWRS, ENTR, U850 a 

Diy O22: 

Fig. 37 shows the functional dependence, AQ/Al sta-— 
tistics and 96%/05% confidence interval values for this 
model. In this case, only the second predictor strictly 
meets the requisite functional dependence criteria ensuring 
no Significant dependence of one predictor upon another. The 
MAXPROB II independent adjusted AO score, which corresponds 


to the peak independent VISCAT I threat score for this case, 
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is -2.47% (unadjusted, 66.49%) as compared to AO scores of 
4.94% (unadjusted, 68.91%) for MAXPROB I and -16.87% (unad- 
fmsted, 61.78%) for natural regression. 

Fig. 38 shows the relationship of equally populous 
grouping size to the adjusted AO (dependent data) and the 
adjusted VISCAT I threat score (independent data) for the 
PR+BMD model. For this case a grouping size of five was 
selected. The results of the three Preisendorfer strategies 
are shown in Figs. 39a through 39c. For this case, the 
MAXPROB II strategy provides the highest independent ad- 
justed VISCAT I threat score of 0.30 (unadjusted, 0.43). 
This peak score slightly surpasses the score of 0.29 
(unadjusted, 0.42) achieved by the MAXPROB I method. The 
trends for these two strategies are nearly identical, showing 
only a slight oscillation in independent threat scores as 
predictors are added. The peak score achieved by the MAXPROB 
II scheme is at the fifth predictor level while the peak 
value for MAXPROB I is obtained with the inclusion of the 
first predictor. It should be noted that the results at 
the first predictor level for the two eaten jonmelavevar LIE abe’: 
SetLategies are identical. A forecast array predicated upon 
a one predictor versus five predictor array size requires 
four orders of magnitude less computer storage resources and 
is therefore a desirable characteristic for an operational 
Forecast system. Additionally, the independent adjusted 


AO scores, achieved by both schemes, have identical maximum 
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values of 19.25% (unadjusted, 73.76%) at the first preadvecem 
level as compared to a maximum value of 12.76% (unadjusted, 
71.47%) for the natural regression strategy at the fifth 
predictor level. The poorest strategy for this case is 
natural regression (Fig. 39c). The independent VISCAT I 
threat scores for this scheme initially yield very low threat 
score values at the first and second predictor levels with a 
subsequent rapid rise at the third, fourth and fifth predic= 
tor levels. This rapid rise however produces a threat score 
value of only 0.19 (unadjusted, 0.34) and a corresponding AO 
of -1.65% (unadjusted, 66.76%) at the fifth and final pred@ueae 
level. The predictors selected for this model are BMD2, 
U1L000, ENTR, DVDP and EAIR. 

Fig. 40 shows the functional dependence, AO/Al1 sta- 
tistics and 963%/05% confidence interval values for this model. 
For this case, three of the five selected predictors do noe 
meet the 96% confidence interval criteria for functional 
independence. This further justifies the use of a single 
predictor forecast array for possible operational use. 

Figs. 41 and 42 show the contingency table results 
for the EVAR and QUAD threshold models. The results of these 
two models are very similar with the EVAR model showing a 
slight advantage in independent adjusted VISCAT I threat 
score of 0.15 (unadjusted, 0.33) versus 0-14 (unadjuscem 
0.31) for the QUAD model. Similarly, the EVAR model achieves 


a slightly higher independent adjusted AO of 13.17% 
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(unadjusted, 71.60%) versus 12.76% (unadjusted, 71.47%) 
for the QUAD model. 

For the EVAR model, a first-stage regression thres- 
hold value of 0.577452 was calculated based upon a VISCAT I 
sample size of 290 observations with a mean of 0.620 and 
Standard deviation of 0.211 and a combined VISCAT II+III 
sample size of 1197 observations with a mean of 0.860 and 
standard deviation of 0.153. A second-stage threshold value 
of 0.548587 separating VISCAT II and VISCAT III was calcu- 
lated based upon a VISCAT II sample size of 328 with a mean 
@ee0.654 and standard deviation of 0.142 and 971 VISCAT III 
observations with a mean of 0.777 and standard deviation of 
weet 36. 

The first-stage threshold value of 0.572592 for the 
QUAD model was generated with the above VISCAT I versus 
VISCAT II+III sample statistics. A second-stage threshold 
value of 0.548717 was based upon 333 VISCAT II observations 
with a mean of 0.649 and standard deviation of 0.138. 

In general, the model which produces the highest 
independent VISCAT I threat score for this case is the 
PR+BMD model while the highest independent AO score is 
achieved with the EVAR threshold model (Table V). The rela- 
tively large independent threat score dominates the scores 
however, and therefore the PR+BMD model is determined to be 


the optimal model in this case. 
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C. NORTH ATLANTIC OCEAN, AREA 4 

Area 4 was selected for evaluation because of its rela- 
tively low frequency (approximately 3% of tEheerotami me. 

VISCAT I observations. It was hoped that this area would 
statistically represent a region where there was an insuffi- 
cient number of VISCAT I observations to allow for sStudyaee 
a forecast region where results were anticipated to be poor, 
yet enough VISCAT I observations to avoid any "rare event" 
statistical entanglements. 

This area encompasses a broad region of the North Atlantic 
Ocean which is generally to the south of area 2 and east and 
southeast of area 3W. Area 4's southern border reaches to 
the northeastern tip of Portugal and extends northward through 
the English Channel to encompass the southern portion oleae 
NOrtn Sea. 

1. Area 4, TAU-0O0 

Fig. 43 shows the relationship of equally populous 
grouping size to the adjusted AO (dependent data) and the 
adjusted VISCAT I threat score (independent data) itor sci. 

PR model. Several unique characteristics were encountered 

for this case which had not been previously been observed. 

The previously observed variation of dependent AO and inde- 
pendent threat scores, associated with the sequential varia- 
tion in grouping size from four through ten, was not initia 
achieved. For this case, non-zero values of dependent AO 


and independent VISCAT I threat score were only achieved 
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Paper enNmee tera tlons er the predictor selection procedure. 
he grouping size of four was deleted from consideration 
in the third iteration because the associated AO value, 
achieved at that predictor level, did not exceed the previous 
AO value at the second predictor level. For this case, the 
independent VISCAT I threat scores maintained indentically 
low values, while a relative peak in AO was achieved at a 
@eeouping size of eight. For this reason, eight was selected 
as the optimal grouping size for this model. 

Figs. 44a through 44c represent the results of the 
three Preisendorfer strategies. For each of the schemes, 
the independent VISCAT I threat scores at the first three 
predictor levels reveal the near-zero scores encountered in 
the grouping size selection process. The highest independent 
Pempi@oced VISCAT I threat score, namely 0.08 (unadjusted, 
O.11) is achieved with the MAXPROB II strategy at the fifth 
and final predictor level. For this model, the MAXPROB I 
p@eenatural regression strategies yield only slightly inferior, 
feemeical independent adjusted threat scores of 0.04 (inad- 
jJusted, 0.07) which are achieved at the fifth predictor 
level. The MAXPROB I strategy yields the highest independent 
eempuisted AD score of -15.77% (unadjusted, 82.45%) as compared 
memscores Of -28.63%3 (unadjusted, 80.50%) for natural 
regression and -34.85% (unadjusted, 79.56%) for the MAXPROB 
II strategy. The predictors selected for this model are 


me=00, DVDP, STRITH, E500 and ENTR. 


61 


Fig. 45 shows the functional dependence, AO/A1 sta- 
tistics and 96%/05% confidence interval values for this model. 
In this particular case, only the third predi¢@cor sarc pe 
a functional dependence value less than the 96% confidence 
interval value. This renders the threat scores achieved by 
this model, beyond the first predictor level, statistically 
not significant, if strict adherence to the basic functional 
dependence criteria is followed. 

Fig. 46 shows the relationship of equally populous 
grouping size to the adjusted AO (dependent data) and the 
adjusted VISCAT I threat score (independent data) for the 
PR+BMD model. For this case a grouping size of nine was 
selected. The results of the three Preisendorfer strategies 
are shown in Figs. 47a through 47c. Generally, the results 
for this model differ very little from the previously dis- 
cussed PR model. This case reflects the first and only 
occurrence where the Preisendorfer methodology coupled with 
linear regression equation predictors (PR+BMD model) did not 
yield superior results to the PR model. The trends for 
these three strategies are generally quite similar. The 
MAXPROB II scheme provides the highest independent adjusted 
VISCAT I threat score of 0.09 (unadjusted, 0.11), as eon 
pared to scores of 0.08 (unadjusted, 0.11) for MAXPROB I 
and 0.07 (unadjusted, 0.10) for natural regression. For veal 
of these three strategies, the maximum independent VISCAT I 


threat score was achieved with the inclusion of theese 
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and final predictor. The independent AO scores associated 
with the peak threat scores are near their lowest values at 
the fifth predictor level with the MAXPROB I scheme yielding 
the highest relative independent adjusted AO of -19.09% 
(unadjusted, 81.95%) followed by natural regression with a 
score of -26.56% (unadjusted, 80.82%) and lastly MAXPROB II 
with a score of -39.42% (unadjusted, 78.87%). The predictors 
selected for this model are BMD2, DUDP, ENTR, DEDP and 

Bm2O00. 

Fig. 48 shows the functional dependence, AO/Al sta- 
tistics and 96%/05% confidence interval values for this 
model. Generally, the relative difference between the func- 
tional dependence and 96% functional dependence confidence 
interval values is much less severe than with the previously 
discussed model. While only the third predictor's functional 
dependence value meets the 96% confidence interval criteria 
for significance, the other predictors are only marginally 
masignificant. 

The application of the EVAR and QUAD threshold models 
to this case presented results which had not been previously 
encountered. The first-stage VISCAT I versus VISCAT II+III 
separation calculation results in a QUAD threshold value 
which is imaginary and an unrealistic EVAR threshold value 
of 209.588882. These thresholds were calculated based upon 
a VISCAT I sample size of 85 observations with a mean of 


-1.012 and standard deviation of 6.280 and a combined VISCAT 
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II+III sample size of 3096 observations with a mean of 
-1.864 and standard deviation of 7.092. "These 7ecut ea 
linked to the preponderance of VISCAT III observations in 
this area coupled with the fact that these employed threshold 
models are designed to provide for a minimum error when 
separating samples. These results indicate that a forecast 
model predicated upon the dependent data set employed in this 
case would strictly forecast VISCAT III. 

2. Area 4, TAU-24 

Fig. 49 shows the relationship of equally populous 
grouping size to the adjusted AO (dependent data) and the 
adjusted VISCAT I threat score (independent data) for the 
PR model. This case required three iterations of the four 
through ten grouping size calculations before any non-zero 
dependent AO or independent VISCAT I threat score values were 
achieved. Additionally, for the grouping size of foun, jae 
increase in AO was observed at the second predictor level 
and therefore was deleted from consideration. A grouping 
size of five was ultimately selected for this model. 

Figs. 50a through 50c represent the results of the 
three Preisendorfer strategies. Generally, the independent 
VISCAT I threat scores yielded for these schemes are poor 
with the highest independent adjusted VISCAT I threat seem 
of 0.05 (unadjusted, 0.07) being achieved by the MAXPROB II 
strategy at the fifth predictor level followed by MAXPRGE Rs 


with a score of 0.02 (unadjusted, 0.05) and natural regrestiman 
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with 0.01 (unadjusted, 0.04). The AO scores corresponding 
to these values provide for a slightly different scoring 
hierarchy. The highest independent adjusted AO score, 
namely 0.94% (unadjusted, 85.64%), is attained by the 
MAXPROB I strategy as compared to scores of -30.05% (unad- 
masted, 81.34%) for natural regression and -37.56% (unad- 
jJusted, 80.05%) for MAXPROB II. The predictors selected for 
this model are VRT925, DTDP, ENTR, V850 and DVRTDP. 

Fig. 51 shows the functional dependence, AO/A1 sta- 
tistics and 96%/05% confidence interval values for this 
model. For this case, each predictor following VRT925 proved 
to be significantly functionally dependent on its predecessors 
and therefore only a single predictor forecast array is 
justifiable for this model. 

Fig. 52 shows the relationship of equally populous 
grouping size to the adjusted AO (dependent data) and the 
adjusted VISCAT I threat score (independent data) for the 
PR+BMD model. AS in the previous case, three iterations of 
dependent AO and independent VISCAT I calculations were 
required before any non-zero scores were achieved. Addi- 
BHonally, in this case, the grouping sizes of four and five 
were deleted from consideration as they did not provide an 
increase of AO at the second predictor levels. The grouping 
Size ultimately selected for this model was nine. 

Figs. 53a through 53c show the results of the three 


Preisendorfer strategies for this model. The scores for 
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this model, as in the previously described case, are quite 
poor and show very little improvement over the PR model. 
The highest independent adjusted VISCAT I threat score, 
namely 0.06 (unadjusted, 0.09), was achieved by the MAXPROB 
II strategy followed by scores of 0.05 (unadjusted, 0.07) 
for the MAXPROB I strategy and 0.03 (unadjusted, 0.06) for 
natural regression. The corresponding independent adjusted 
AQ scores show a maximum score of -19.25% (unadjusted, 
82.71%) for the MAXPROB I strategy followed by scores of 
-30.05% (unadjusted, 81.14%) for natural regression and 
-39.91% (unadjusted, 79.71%) for the MAXPROB II Strategye 

Fig. 54 shows the functional dependence, AO/Al1 sta- 
tistics and 96%/05% confidence interval values for this 
model. For this case, the relative magnitude of the differ- 
ence between the actual functional dependence and its 96% 
confidence interval value is quite small. It is only at the 
second predictor level, that the calculated values do not 
exceed the corresponding 96% confidence interval value. 

Fig. 55 shows the contingency table results for the 
EVAR threshold model. The QUAD model provided an imaginary 
threshold value at the second regression stage and therefore 
did not allow completion of the entire separation sequence. 
This represents the only occurrence where a valid equal 
variance threshold was calculated but a corresponding quadratic 


threshold proved to be imaginary. 
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The results of the EVAR model were in keeping with 
those of the previously described PR and PR+BMD models. An 
independent adjusted VISCAT I threat score of 0.05 (unad- 
masted, 0.07), was achieved with a corresponding independent 
payusted AO value of -13.15% (unadjusted, 83.59%). 

The first-stage regression separation for this model 
was based upon a calculated threshold value of 0.908275. 
Associated with this threshold were 449 VISCAT I observations 
with a mean of 0.953 and standard deviation of 0.030 and 
2489 VISCAT II+III observations with a mean of 0.976 and 
standard deviation of 0.027. The second-stage VISCAT II 
versus VISCAT III separation was based upon a calculated 
meresnold value of 0.683569. Associated with this threshold 
is a VISCAT II sample size of 69 observations with a mean of 
Moo! and standard deviation of 0.066 and 887 VISCAT III 
PeserVations with a mean of 0.912 and standard deviation of 
emo 7 8. 

For the QUAD model, an initial first-stage threshold 
value of 0.908275 was successfully calculated with the 
Sample statistics addressed above. The second-stage regres- 
Sion attempt was based upon a VISCAT II sample size of 65 
Seservations with a mean of 0.829 and standard deviation of 
Moo? and VISCAT III sample size of 853 observations with a 
mean of 0.905 and standard deviation of 0.079. These sample 
statistics produced an imaginary threshold value. 

In general Area 4, TAU-24 is characterized by very 


poor independent VISCAT I threat scores. This indicates 
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that there is very little skill in forecasting = ic eee 
conditions of less than or equal to 2 kilometers in this 
area. The evaluated models show little variation in scores 
with the best relative model for this area and forecast 
projection being the PR+BMD model (Table V). 

3. Area 4, TAU-48 . 

Fig. 56 shows the relationship of equally populous 
grouping size to the adjusted AO (dependent data) and the 
adjusted VISCAT I threat score (independent data) for the PR 
model. As in the TAU-00 and TAU-24 forecast projections for 
this area, the calculation and evaluation of dependent A0 
and independent VISCAT I threat scores had to be run throgee 
three iterations before any non-zero statistics were obtained. 
The grouping sizes of four and five were deleted from con- 
Sideration because the addition of predictors at those grouping 
sizes did not provide for any increase in AQ scores. Based 
on an evaluation of the results as shown on Fig. 56, ‘a group- 
ing size of seven was selected. 

Figs. 57a through 5/7c show the results O£ tie tive 
Preisendorfer strategies for this model. In general, the 
near-zero statistical scores encountered in the grouping 
selection process, can be seen through the third predictor 
level, along with a noticeable increase in scores at the 
fourth and fifth predictor level. The MAXPROB I Strategy 
yields the highest independent adjusted VISCAT I threat 


score, namely 0.18 (unadjusted, 0.20), for this model 
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followed by a natural regression score of 0.16 (unadjusted, 
feo) and lastly by MAXPROB II with a score of 0.13 (unad- 
aiecea, 07 lo) liicmis the first and only encountered case 
where the natural regression strategy effectively achieved a 
maximum independent VISCAT I threat score which is higher 
than either of the two maximum probability strategies. The 
independent AQ scores associated with these peak independent 
VISCAT I threat scores, adhere to this same scoring sequence, 
with MAXPROB I achieving a value of -14.94% (unadjusted, 
81.86%) followed by natural regression with a score of 
-27.80% (unadjusted, 79.83%) and MAXPROB II with an indepen- 
dent adjusted AO of -43.98% (unadjusted, 77.78%). 

Fig. 58 shows the functional dependence, AO/Al sta- 
tistics and 96%/05% confidence interval values for this model. 
In this case, only the third predictor's functional dependence 
value falls below the associated 96% confidence interval 
value. The predictors selected for this model are VRT925, 
Peer DP, ENTR, DUDP and RH. 

Fig. 59 shows the relationship of equally populous 
grouping size to the adjusted AO (dependent data) and the 
adjusted VISCAT I threat score (independent data) for the 
PR+BMD model. As in the previous area 4 cases, three com- 
plete iterations of the four through ten grouping size calcu- 
lations had to be performed before any non-zero dependent 
AO or independent VISCAT I values were achieved. The 


grouping size ultimately selected for this model was nine. 
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Figs. 60a through 60c represent the results of the 
three Preisendorfer strategies. A unique result of wens 
model is that for the first time, the PR+BMD model did not 
achieve independent VISCAT I threat scores which exceed those 
achieved by the PR model. The peak independent adjusted 
VISCAT I threat score of 0.17 (unadjusted, 0.20) is achieved 
by the MAXPROB II strategy at the third predictonm mle, im 
The predictors selected for this model are BMD1, DDVDP, 

DUDP, ENTR and PRECIP. 

Fig. 61 shows the functional dependence, AO/Al sta- 
tistics and 96%/05% confidence interval values for this 
model. Only the second predictor sufficiently meets the 96% 
confidence interval significance criteria. Based upon a 
strict adherence to the preselected 96% confidence interval 
Significance requirements, these functional dependence 
values provide cause for uncertainty in the representative- 
ness of the scores achieved after the second predictor level. 

Figs. 62 and 63 show the contingency table results 
for the EVAR and QUAD threshold models. The results of these 
models are very similar and generally quite poor. The QUAD 
model achieves the highest relative independent adjusted 
VISCAT I threat score of 0.01 (unadjusted, 0.04) as compared 
to a score of -0.01 (unadjusted, 0.02) for the EVAR model® 
The QUAD model similarly achieves the highest independent 
adjusted AO score of -2.07% (unadjusted, 83.89%) versus a 


score of -7.88% (unadjusted, 82.97%) for the EVAR model. 
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For the EVAR model, a first-stage regression thres- 
hold value of 0.847203 was calculated based upon a VISCAT I 
sample size of 109 observations with a mean of 0.918 and 
Standard deviation of 0.052 and 2947 VISCAT II+III observa- 
tions with a mean of 0.967 and standard deviation of 0.037. 
The second-stage VISCAT II versus VISCAT III separation was 
based upon a calculated threshold value of 0.629338. Asso- 
Ciated with this threshold is 495 VISCAT II observations with 
mean Of 0.861 and standard deviation of 0.103. 

For the QUAD model, a first-stage separation thres- 
hold value of 0.847203 was calculated upon the associated 
sample statistics addressed above. A second-stage threshold 
value of 0.613739 was calculated based upon 481 VISCAT II 
observations with a mean of 0.770 and standard deviation of 
0.089 and 2522 VISCAT III observations with a mean of 0.862 
ema standard deviation of 0.100. 

The overall results associated with the area 4, TAU- 
48 case are particularly unique. The independent adjusted 
TAU-48 VISCAT I threat score represents the highest area 4 
macdependent VISCAT I threat score (by a minimum of 0.09) 
achieved, as compared to TAU-00 or TAU-24. The maximum 
independent VISCAT I threat score is achieved by the PR 
model. 

Following the completion of the testing and evaluation 
of the FATJUNE 1983 data set, a series of preliminary experi- 


ments were performed with the May 15 to June 23 1984 data 
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set for the TAU-24 model forecast projection. These experi= 
ments consisted of evaluating the TAU-24, 1983 forecast 

arrays and equations (generated with FATJUNE 1983 data) 

with training and testing cases of TAU-24, 1984 data. In 
performing this evaluation, the 1984 data set was divided 

into "dependent" and "independent" portions. This data 
separation is a function of the specific mechanics of the 
computer programs utilized in this study and is not associated 
with the generation of additional forecast arrays or equations. 
Two homogeneous areas were evaluated, namely area 2 and area 
3W. This essentially provided an independent verification 

of the utility of the 1983 forecast arrays and equations in 
predicting observed 1984 visibility in these areas. 

In general, the skill and contingency table resuilee 
for these experiments compare very favorably to those achieved 
with the FATJUNE 1983 data. A summary of the results of 
each of the evaluated models is provided in Table VII. For 
area 2, a peak independent adjusted VISCAT I threat score 
(1984 data), namely 0.27 (0.33 unadjusted), was acmievea 
with each of the two threshold models. This compares to a 
peak independent adjusted VISCAT I threat score (1983 data) 
of 0.29 (0.34 unadjusted) achieved by the same two modemas 
For area 3W, a peak independent adjusted threat score (1984 
data) of 0.28 (unadjusted, 0.42) similarly Compares meee. 
peak independent adjusted threat score (1983) of 0.13 


(unadjusted, 0.36). 
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The overall results of the 1984 data experiments can 
be seen in Table VII and are represented by Fig. 64 which 
illustrates the results of the PR+BMD model, MAXPROB II 
strategy, for area 2, TAU-24. 

A review of the results associated with area 4 for 
TAU-00, TAU-24 and TAU-48 indicates that none of the models 
evaluated achieved very encouraging skill and threat scores. 
Consequently, the maximum-likelihood-of-detection criteria 
(MLDC) was proposed as an alternative technique to increase 
threat scores in area 4. 

A series of experiments involving an arbitrarily 
selected population of two hundred normally distributed 
events, partitioned into eight separate threat/non-threat 
Samples, were performed to demonstrate the theoretical utility 
of the MLDC at low threat frequencies. Threshold values 
were calculated, for various threat frequencies, using the 
EVAR minimum probable error and MLDC techniques and two by 
two contingency tables were constructed to tabulate the asso- 
Ciated threat score, percent correct and false alarm rate 
results. Fig. 65 shows the resulting plot of threat score 
Versus threat frequency which illustrates the amount of in- 
crease in threat score associated with the MLDC model. Asso- 
Ciated with these higher threat scores are correspondingly 


higher "costs," namely higher false alarm rates, illustrated 
in Fig. 66, and lower percent correct scores, illustrated in 


ree, 67. 


PS 


A set of two experiments was performed, utilizing 
FATJUNE 1983, TAU-24 data and the two-stage separation 
scheme outlined in Chapter IV (MLDC model), to evaluate the 
relative performance of the MLDC and EVAR models on area 4. 
In general, the results of these two experiments were consis-— 
tent with the results predicted by the aforementioned 
theoretical experiments. The most obvious area of agreement 
is the significantly lower independent adjusted VISCAT 7 
and VISCAT II threat scores (both are considered threatening 
events in this study), namely 0.01 (unadjusted, 0.04) and 
-0.14 (unadjusted, 0.00), achieved by the EVAR model, Fig. 
68, as compared to the corresponding scores of 0.04 (unad- 
justed, 0.07) and 0.03 (unadjusted, 0.15) achieved by the 


MLDC model, Fig. 69. 
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VI. CONCLUSIONS AND RECOMMENDATIONS 


fee CONCLUSIONS 

The primary objective of this study was to expand upon 
the initial research and experimentation presented by Karl 
(1984) and to propose a viable statistical forecasting scheme 
Suitable for eventual employment in an operational U.S. Navy 
marine visibility MOS forecasting system. In general, 
while the results of linear regression and the evaluated 
Preisendorfer models are roughly comparable, it has been 
shown that two specific statistical approaches, namely the 
PR+BMD model's MAXPROB II strategy and the linear regression 
models, yield the best results (as measured by independent 
VISCAT I threat score) achieved in this study. The PR+BMD 
model achieved the best results for six of the eight evaluated 
cases: area 2, TAU-48; area 3W, TAU-24 and TAU-48; and area 
4, TAU-00, TAU-24 and TAU-48. The nearly identical results 
of both the equal variance and quadratic linear regression 
threshold models provided the best skill and threat scores 
for area 2, TAU-00 and TAU-24. A common characteristic of 
Gach of the evaluated cases is that the predictability of 
Visibility category II is relatively very poor and nearly 
mevays poorer than that for visibility Categories I or III. 
This pattern affirms the findings of similar Pacific Ocean 


Visibility studies [Renard and Thompson, 1984] as well as 


72> 


those documented by Karl (1984) and further supports Karl's 
recommendation to change from a three-category to a two- 
category visibility forecasting scheme. 

An evaluation of the overall results of this study shows 
that no real connection between individual model/strategy 
and either the homogeneous oceanic area (2, 3W and 4) or 
model output time (TAU-00, TAU-24 and TAU-48) can be made. 
The linear regression threshold models performed best for 
area 2, the intermediate poor visibility oceanic area, while 
the Preisendorfer methodology incorporating linear regression 
equation predictors proved the best in the evaluated homogene- 
ous areas with the greatest and lowest relative concentration 
of poor visibility observations. The trend of visibilia, 
category I skill and threat scores, for each homogeneous area 
and model output time, seems to contradict the preliminary 
Supposition that peak skill scores would be associated with 
the area containing the greatest frequency of poor visiblity 
observations and the TAU-00 model output time. This result 
is most apparent with area 4, where threat scores increase 
with increaSing model forecast projections until they achieve 
values at TAU-48, which are nearly identical to those for the 
other two homogeneous areas. This type of trend in skill 
and threat scores most likely reflects the overall strength 
of the statistical relationships for the predictand/predictors 
involved irrespective of the frequency of specific visibility 


ODSeEvatlrons. 
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In several cases, the maximum independent visiblity 
eateqory =: threat score achieved by the PR+BMD model was 
reached at the first predictor level. In several additional 
cases, threat score values which were only marginally lower 
than peak value, were similarly achieved at the first pre- 
dictor level. Forecasting arrays involving only one predic- 
tor drastically reduce required computer storage and 
consequently such arrays are a desirable attribute to any 
Operational MOS forecasting system. A MOS-type forecasting 
system predicatedupon such a small number of predictors 
would prove extremely beneficial in an independent single 
station forecasting scenario such as that experienced by an 
@rccraft carrier based U.S. Navy Oceanography Officer. 

The concept and practical employment of functional 
dependence, associated with the Preisendorfer methodology, 
provides a greater restriction on the statistical significance 
of the skill and threat score results achieved in this study, 
as compared to that which was previously experienced by 
Karl (1984). It was shown that the calculated functional 
dependence values for each respective predictor or group of 
predictors often exceeded the associated 96% confidence 
interval value at the first or second predictor level and 
rarely met the requirements for significance for an entire 
array of selected predictors. This restriction further 
indicates that any operational forecasting scheme should 
most likely be composed of only a minimal number of select 


mGeedictors. 
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The difference between the equal varlanCe anduguacmeee 
threshold models was shown to be very minimal. The two-stage 
visibility category separation approach is designed to 
handle cases with distinct separability between categories 
while providing for minimal error in the calculated threshold 
values. This condition was not met in the area 4, TAU-00 
and TAU-24 cases and subsequently lead to unrealistic thres- 
hold values. 

The preliminary independent evaluation of the 15 May to 
23 June 1984 data set provided a crucial test and verifica- 
tion of the utility of the forecast arrays and equations 
presented in this study. 

The introduction and initial evaluation of the maximum. 
likelihood-of-detection threshold model offers another 
technique to the pool of visibility puedicticneschones 
This method appears to be most beneficial in areas of low 
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B. RECOMMENDATIONS 

The following recommendations are offered to future 
researchers: 

1. Remove the MAXPROB I and natural regression strategies 
of the Preisendorfer methodology from further consideration 
in the forecasting of marine horizontal Visi vies 

2. Delete one of the two threshold models evaluated 


in this study, and investigate additional thresholding 
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techniques based on the Beta distribution and the maximum- 
likelihood-of-detection criteria. 

3. Revise the current three-category visibility scheme 
to a two-category scheme where visibility categories I and 
II are combined. This should be particularly beneficial in 
those homogeneous areas with extremely low frequencies of 
visibility category I and II observations. 

4, Expand the initial set of potential predictors to 
include air-sea temperature differences as well as additional 
derived predictors such as the advections and gradients of 
meiieerature, vorticity and moisture in order to more fully 
Simulate the physical processes associated with poor marine 
visibility. Additionally, include TAU-00 and TAU-24 model 
Sucput parameter fields as potential predictors in future 
evaluations of TAU-24 and TAU-48 MOS forecasts. 

59. Evaluate OOOOGMT data sets to determine the effect 
PeeiLghnttime conditions on both visibility observations and 
NOGAPS model output parameters. 

6. Investigate new procedures to determine the number 
of equally populous predictor intervals. The following 
procedure [Preisendorfer, 1984] is proposed: 

a. To establish the number of equally populous predic- 
tor intervals for any predictor, consider a bivariate 
predictand/predictor [Preisendorfer, 1983a]. Start with 
m= 1 and find the potential predictability (PP) for the 


resultant plot, call it "PP(1l)." In general, PP(m) is the 


The, 


PP for the general case of m. Successively, find Prim 
for m= 1,2,..., and continue to subidivde the preqrecen 
range as long as PP increases: PP(m) < PP(m+l). Stop at 
PP(m) if PP(m+l) < PP(m) or if PP(m+l) < PP(m+1|96), where 
the later is defined by Preisendorfer (1983a) and denoted 
by "PP(96)." This last condition avoids sparse bivariate data 
plots, caused by too large an m. It was Karl's 1984 experi- 
ence that five to eight equally populous predictor intervals 
are sufficient for all predictors. Hence m, for each pre- 
dictor, is expected to be in this neighborhood. 

b. Order the set of available predictors in descending 
value of potential predictability (PP). Break Cies 7am 
AO (PP and AO are defined by Preisendorfer (1983a)). AO 
is the actual skill, after the prediction has been made. 


c. The first predictor is that with the greatest 


(0) (0) 


PP. Compute associated AO and Al. Call them AO and Al . 
7. Associated with recommendation 6. above, improve 
the predictor selection procedure as follows: 
a. Suppose k-l predictors have been chosen, let 


them be X Let Y be a new predictor candidate. 


peers yy 

Admit Y as the kth predictor if the three following conditions 

are satisfied: 

(1) Functional dependence (Y|X.) < funetionam 
dependence, (Y|X, 705) for i=1,...,k=l 


(i1.e., the functional dependence of xX. 


and Y is not significantly large Wor eae 
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ee ie ind Lunctlional dependence 
(y |X.) Puemnline =Lola == aependence (|X, 705) 


as described by Preisendorfer (1983c)). 


(k) iss 1) (kK) ese) 


emda AL Sen 


(k) 


(2) AO > AO 


ons) Zi (05) 


(3) AO > AO(96) amd Al 
All three conditions must hold for admittance of Y to the 
predictor set. 

b. A less stringent predictor selection process 
would be to form functional dependence (Y.1Xj), where Xr 
1 =1,...,k-1 are the selected predictors, and the Was 
wee, ...,q are the as-yet unselected predictors. Here 


fe!) +q = p, the original number of potential predictors. 


Next, form |min|max functional dependence (v5 [X3) ||. This 
i] a 


fixes that Y. for which functional dependence (Y.|X5) is 
the least possible of the maximum functional dependence 
values over the present X set. This makes the best out of 
the worst case of functional dependence (i.e., select the 
Y. farthest from the set of X.). 

eamcontinue tO repeat step 7a. above until all 
potential predictors are used up (the critical values of 
A0(96) and A1(05) are as defined by Preisendorfer (1983b). 
Another reason for stopping may be that allotted CPU time is 
used up before the predictors. 


8. Investigate a further and more complete verification 


of the forecast arrays and equations presented in this study 
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utilizing available 1984 data sets. Specitically a eee 
the 1984 data set as an entire independent test case without 
first removing a portion of the data for use as a dependent 
forecast array/equation training set. Additionally, gener- 
ate an additional set of forecast arrays and equations 

based on combined 1983 and 1984 FATJUNE data and evaluate 
the statistical stability of the equations as different 


years of data are merged into a larger data base. 
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APPENDIX A 


LINEAR REGRESSION AND THRESHOLD MODELS 


EE te 


A. LINEAR REGRESSION 

The linear regression techniques used in this study 
expand upon and slightly modify those first presented by 
Karl (1984). In this study, two separate least-Squares, 
multiple linear regression software programs; referred to 
as the BMDP2R--Stepwise Regression and BMDP9R--All Possible 
Subsets Regression computer programs in the BMDP Statistical 
Software [University of California, 1983], were used. 

The independent variable selection procedure employed in 
the BMDP2R program is referred to as a forward, step-wise 
selection process where predictors are selected from a large 
group of available potential independent variables based 
upon the highest correlation with the Bean predictand 
Persability in this study). This correlation is calculated 
based upon certain F-to-enter and F-to-remove limits, where 
F is a ratio which tests the significance of the coefficients 
of the predictors in the regression equation. 


The regression model fitted to the data is 


where: 
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y = the dependent variable (predictand) which 
can be either a continuous function enue 
discrete value 


* 
il 


the independent variables (predictors) 


La n 
be. gave = the regression coefficients 
1 n 
a = the intercept 
p = the number of independent variables 
€ = the error with mean zero 


nw 


The predicted value y, and the general form of the resulting 


equation, is 
Y & Cal tebe ee ee box 


The step-wise selection of predictors continues until there 
are no predictors remaining which meet the requisite F-to- 
enter criteria. The regression equation generated by the 
BMDP2R program is outputted at Gach regression step where 
variables are selected as independent predictors, along with 
its corresponding R value (the correlation of dependent 
variable y with the predicted value a and Pe value. The 
resulting equation sets are reviewed, and that equation con- 
taining only those predictors which increased Ro by at least 
0.01 are retained for application. 

The procedure employed with the BMDP9R program varies 
from that of the BMDP2R, in that a "best" possible subset, 
derived independently of variable or variable sequence, is 


calculated from the group of potential predictors. Once this 
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"best" subset is identified, a linear regression equation is 
fitted to the data, based only upon those selected predic- 
tors, in a fashion identical to that for the BMDP2R program. 
The "best" possible subset is calculated by a Furnville- 
Wilson algorithm which provides the user with a variety of 
subordinate subsets in addition to the "best" subset. Three 
criteria are available to define the "best" possible subset 
as a function of independent variables (predictors) anda 
dependent variable (predictand): the sample Rr’, the adjusted 
R’ and Mallow's Cp. For this study, the Mallow's Cp criteria 


mee defined as: 
Cp = RSS/S - (N — 2P') 
where: 


RSS = the residual sum of the squares for the 
new subset being tested 


S = the residual mean square based on the 
linear regression using all independent 
variables 

P' = the number of variables in each subset 

N = the total number of cases 


For this method, "best" is defined as the smallest Cp value. 
Independent variable selection for the BMDP9R program 
begins with a general screening of the entire set of potential 


predictors. Variables which are identified as redundant, 
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linear combinations of other variables, with respect to the 
predictand, in this general screening are deleted from further 
consideration. The t statistics for the coefficients which 
minimize the Cp value for each reviewed subset identifies 

the "best" subset. The number of predictors assigned to each 
subset can be predefined and for this study each subset 
equation was required to have six predictors. 

The role of regression, once appropriate predictor varia- 
bles have been selected, is simply that of dimension reduction 
(representing a multivariate structure by a univariate 
proxy which constitutes a classificatory or predictive 
index). This proxy takes the form of a polynomial, linear 
in its coefficients, of the components of the multivariate 
structure. The problem now becomes one of determining the 
form of the state conditional distributions (one for each 
group of interest; e.g., one, two and three for visibility 
categories I, II and III, as used in this study). Once an 
appropriate form has been selected, it remains, then, to 
determine the parameters of the class conditional distribu- 
tions (e.g., means and variances) and then apply an appro- 


priate decision criterion or threshold mocerl 


B. THRESHOLDS [Lowe, 1984a] 


i Notarion 


E = an event; this is an indicator variable which 
when E = 1, the threatening event occurs, and 
when E = 0, the non-threatening event occurs. 
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Cay = 
eae — | = 
eae —O| = 

[higiene 

igigiene 


mute =1 7 E =0) 
Pre =O nE=1) 
Pe = 1/5 =0) 
P[C =0|E =1] 


© ke See 0) 


mate = 0 n E =1) 


the classification of an unknown event which 
when C = 1, the event is classified as a 
threat, and when C = 0, the event is classi- 
fied as a non-threat. 


unconditional probability of occurrence of 
threat. 


ineonewtienal probability of occurrence of 
NON] EnGeeat. 


of the lst kind (false alarm) [SE ey 1 0 
One eas Lac eileen) hee Oeil). 


Seo Miempmol awe Of an error of the 1st 
Sails 


=Olleepwonambatrty Of an error of the 
PMG Kael 


=sseelass conditional probability of misclassi- 
fying a non-threat. 


eweclass conditional probability of misclassi- 
fying a threat... 


Elter= 5 — 0) Pie = 0) - 


alee hOnewe—elalers f— O ) 


wee ue einen plGedTelive index (equivalent 
Ome ac OMe ).. 
Z = yrange of the predictive index on the real line. 
BOL a dichotomous problem, Z is into two parts Zoe Zi 


time decision 


o.eC., Zo n 2, 


regions are mutually exclusive and exhaustive 


= 0 and Z = Zo uz). 


Oy 


Thresholds boundary(s) between decision regions. 


Clo ieee = class conditional density of z given 
that for 

eZ |e) = class conditional density Of semaine em 
tel clea — one 

A(z) = p(z|E=1)/p(z|E=0) = the maximum likelihood 
ratio ({i.e., the ratio of claSs coOndiciromes 
densities). 

Deeps p{{C=lnE=0] vu {[C =0 nB=1l)) 0] theme 


probability Of -ernom. 


2. Minimum Probability of Error Cri cemieng 


Da. Fe probability of an incorrect classi=@teabuene 

P, = pIC=1)/E=0] plz =O) pie — Oo) reese 
where p{E=1] + p[E=0] = 1. NOte that the events E = l 
and E = 0 are mutually exclusive and exhaustive. The objec- 


tive is to select decision regions (thresholds) so as to 


minimize Pee 


p{(C=0O|E=1) = f p(z|E=1)dz = the probability aem 
zea 
0 
misclassifying E = l. 
pic =0|E=1] = f p{zjE=)idz4 f) pae oes 
Zea Zea, 


- f p(z|E=1)dz 


Zed, 
picH— ole — 15 = ee f PC Zee Vidz these are 
zed, substituted 
into the 
expression 
p(C=1|/E=0] = J D(z ie — oe for p, 
Zea, 
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then, 


eee rial —O)az + pIE=1){1 = f{ p(zl|E =1)4z] 


Zea, Zea, 


and algebraic rearrangement yields, 


fee «=PLE=1) - | ee eer Oliepiz|H=—0) — plE=1) p(z|E=1) @z} 
ZEG 
i 
me order to minimize Por Zs (the decision region for C = l) 


feel include all those values of z for which the integrand 
miethe expression for Po will be negative. The decision 
regions can be symbolically represented as follows: 


eZ oe = Olemiz)! = 0) = plb =1) p(z|E=1) > 0} 


oer oe — 0) ole D(z) =i) < 0} 


N 
I 


An alternative representation is given by, 


Peo Ol par — Oe > ple = di) o(2|E = 1) } 


Sez ely ol — leno (zie — W)/p( zi = 0) } 


Likewise, 


Z, = 7 ee —aeal ob i 7a (7 |e = 0) } 
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These statements can be combined to give, 


19) 
-_) 


ll Ae I 


© 


p(z|E.=1)/p(z | sey eee ez) p[E =0)/ole = 


Q 


Thresholds are the value(s) of z for which 


A(z) “= el =017 > (Ee 


This equation can be solved for z either analytically or 
numerically depending on the forms of the density functions. 
3. Threshold Cases 
In order to examplify the model, the assumption is 
made that the class conditional distributions are Gaussian. 
There are essentially three distinct cases that can arise. 
a. Case I: Equal variances; different means 


(Referred to as the equal variance model (EVAR) 
in the text) 


EZ ee = k exp{(-1/2) (z -u,)*/07} 
Zine 
P(zZ|fh 0) =Skvexpl (= 2az Sage Vie | 
where: 
aie 


expr (=i 2)aez -4)°/07} ee Po 
NiGzZ = ea < oo 
exp{(-1/2)(Z -up) Jo} Q5 1 


30 


Density 


where A is the likelihood ratio and Py = Dile= 0) anda 


Py = p{E =1]. Thus, the threshold value is 
EZ = (ip ate can (6 Vee (ia = te) 
0 i Ga Il 0 
i 
E-0 Ee = 1 


Classification index (z) 


The position of the threshold depends on the relative values 


ar Py and Po- The threshold moves toward the group with the 


smallest P.- ets P1 = Po the threshold will be the value of 


z where the densities intersect (i.e., where the densities 
are equal). 
b. Case II: Equal means; different variances 





o,exp{(-1/2) (z-u,)°/os} SS" pp 
A(z) = Se a er ; an 
oa _ p 
o,exp{(-1/2)(z-ug) /og} Gog 1 
with the threshold 

20604 Py] Ife 

nn \emom. = ecm 

(0,-99) 1°0 


Ol 


Note that in this situation there are two thresholds. The 
group having the smaller variance will lie between the two 


thresholds. 


Density 
SC 
Za 


Classitication inGgexnw.Gz) 


The thresholds shown are typical of a situation where P, < Po: 
Note that these thresholds lie between the two intersections 
of the densities. If the inequality of prior probabilities 
were reversed, the thresholds would lie outside 62 ene 
region between the two density intersections 7 = Furcner eae 
that the decision region for the group having the lesaee 
variance lies between the thresholds. 


c. Case III: General Solution (Referred to -asmeae 
Quadratic Model (QUAD) im@the vee.) 


p(z|E=1) = k/o, exp{(-1/2) (z -u,)*/o%} 


Diz ee 0 or exp{(-1/2) (z - uo)? 40g} 


a2 








C= 
A ae, po Pee , > Po 1 
0 < 
0 1 oe 


0 
poCZ ee | Exe 7 2 [ at 
: Pie 


maere k = ee oe 


Algebraic manipulation produces 


2 2 2 2 Z 
(6) -9)2 io 2(o uy - OH) 2 


22 22 22 
+ [loqUg - 99h) ~ 20907 In (Ppo4/P1 59) | 


which is recognizable as a quadratic equation in Zz. 


tS eb - = 4actee?72a 
where: 
Mee eo ng 
1 0 
b = 2X eo ie) 
Sie 10 


_ DD Meee a2 
c = (oyu, Oph) 207Ho In (P95) /P}%) 


3 





Density 


Classification index (z) 


The remarks given for the figures in cases I and II are weniee 
applicable here. More often than not, only one of a padi 
thresholds induced by differing variances will be of real 
interest. If the variances of the two groups are radically 
different, then both members of the threshold pair become 
INO felts. 


4. The-Maximum-Likelihood-of-Detection Criteria 


For this specific model the following backqroundaaee 


provided: 

event space: 2' mutually exclusive populations 

Tor Ty forecast decision space: 2 possible forecasts 
OMe ail 

do is a correct forecast if T) actually eweeuae 

qd, 1S a COmrect forecast aa: 7) actually occurs 
Problem: select the decision rule d(z) which maps 


the observation space Z into some forecast Space 


in some optimal manner. 
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Z may be an observed variable or it may be an 


Mme nice amclekayea from a number Of variables. 


For this two decision problem, Z is partitioned 


iio EWO PabUs, eawane Z)- 


0 
d(z) = dy ait nl 7 ea Zo 
aez) = ds ey a te Za 
where a q Za =) lige! Zo U Za = Ys 


The maximum-likelihood-of-detection criteria repre- 
sents the simplest decision model. The basic involves 
selecting the forecast (decision) corresponding to the obser- 
vation (signal) which is the most likely symptom of the event 


subsequently observed. Consider the following example: 


problem: diagnose disease A or disease B. 


The observed symptoms occur with probability 0.75 

for A and 0.1 for B. By the maximum-likelihood-of-detection 
Criteria (MLDC), diagnose disease A because A is the most 
likely cause of the observed symptoms (if there is no more 
information). But if we know that A is rare and B is common, 
the above decision may not be optimal and MLDC may not be 
appropriate. MLDC requires only that we know the event 
conditional probability density functions of the observations. 


That is: 


95 


p(z| 1) and p(z|m,) 


| dy eee p(z|m,) > p(z|t)) 
decision rule: d(z) = 


dy iG p(z|7,) < p(z|m) 


In the following development the Gaussian density 


is used to exemplify the model. 





ee eae 
p(z|a,) = 1//296n exo 2. 
0 0 mn 
Kom Za 5 
p(z|7,) = I1/v2nc0, exp{-1/2( Para 
il il O71 
p(z|m,) 
definition: lIikelihcod ratrem eae 
Bez To) 
for convention sake we assume zy > Za 
2 Z 
oie ano 
Mo 
Ho es My a 
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* note the class having the largest variance has a 


bifurcated decision region. 


In the case where the variances are equal, the 


Situation simplifies considerably. 


d 


ib 
Z — 2,2 -—2,. < 
20 (Zy -Z9) +o (Z -2Z)) 0 
oo 
d 
1 
2 — 2,—2 2 
20 (Z) - Zo) - oO (Z) - Zy) : Q 
= 
do 
> -_ 
22 : (Z, +Z9) 
a 
> (Z) +29) . 
Z — es 
= 2 


Om, 


Ho f HU, 

It is obvious that z* is simply the average of the 
means of the class-conditional distributions and is found 
at the intersections of the two density curves. 

In the foregoing, normal class conditional distrijm@e 
tions were assumed. This was done because the Gaussian fOim 
admits of a rather clean analytical solution. However, the 
general concept of the minimum probable error decision 
Criteria may be applied to any form of density functirene 
Indeed, the density function of one group need not even be 
the same form as that for another group (one might be exponen- 
tial and the other Gaussian). The difficulty with most nem. 
Gaussian forms is that they seldom admit of closed analytical 
forms and require numerical means in determination of 


thresholds. 
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APPENDIX B 


NOGAPS PREDICTOR PARAMETERS AVAILABLE FOR 
NORTH ATLANTIC OCEAN EXPERIMENTS 


Area: Entire North Atlantic Ocean and Mediterranean Sea 


Model output time: 1200GMT (TAU-00) 


Model output 
Panane ees 


D1O000 
BY 25 
D850 
BI 00  * 
D500 
D400 * 
pSO00 ~* 
Bzo0 * 
TAIR 
T1000 
2 
70.0) * 
i210'0 
T400 * 
i200  * 
mZ50 * 
EAIR 
E1000 
mg 25 
Ee 50 
E700 * 
E500 
UBLW 


vidya uly) 19S 3 


Descriptive name of parameter 


1000 mb geopotential height 


925 mb geopotential height 
850 mb geopotential height 
700 mb geopotential height 
500 mb geopotential height 
400 mb geopotential height 
300 mb geopotential height 
250 mb geopotential height 


Surface air temperature 


1000 mb temperature 


925 mb temperature 
700 mb temperature 
U0 ioe cipemacune 
400 mb temperature 
300 mb temperature 
220 mo temperature 


Surface vapor pressure 
1000 mb vapor pressure 
925 mb vapor pressure 
850 mb vapor pressure 
700 mb vapor pressure 
2900 mb vapor pressure 


Boundary layer zonal wind component 


og 


U1000 


1000 mb zonal wind component 


UI2> 925 mb zonal wind coOmpoenene 

U850 850 mb zonal wind component 

7 OR 700 mb zonal wind component 

U500 500 mb zonal wind component 

U400% = 400 mb zonal wind component 

veO0m 300 mb zonal wind component 

U2 Soe 250 mb zonal wind component 

VBLW Boundary layer meridional wind 
COMmpeenene 

V1000 1000 mb meridional wind component 

VI25 925 mb meridional wind component 

V850 850 mb meridional wind component 

V7 OO. 700 mb meridional wind component 

vVo00 500 mb meridional wind component 

V400 * 400 mb meridional wind component 

VvsO00 "> 300 mb meridional wind component 

Vi 250° 4 250 mb meridional wind component 

WORD 25, 2% 925 (Nba VOmrtcriy 

VORDOO +s 500 mb evorete cy 

PS Surface pressure 

SMF Surface moisture flux 

PBLD Planetary boundary-layer depth 

STRIEO Percent stratus frequency 

STRUT Stratus thickness 

Sir: Surface heat flux 

ENTRN Entrainment at top of marine 
boOUnda rye haves 

DRAG = * Drag coefficient (C1) 


Derived parameters 


DTDP Vertical gradient of temperature 
(1000-9237 mbps) 

DEDP Vertical gradient of vapor pressure 
(1000=2507 mos) 

DUDP Vertical gradient of zonal wind 


(1000-850 mbs) 


100 


Ee 


DVDP 


RH 
ay 
DDVDP 


DY RDP 


ESUM 


Be RD 


Area: 


Vememecreegrad tent Ofemeridional wind 
(1000-850 mbs) 


Surface relative humidity 
Vintucder cemperature 


Vertical gradient of geopotential 
height (1000-850 mbs) 


Vereneweecracdlent Of YOrcCicity 
ro00— 925 mas) 


Sum of vapor pressures 
(1000-850 mbs) 


Product of vapor pressures 
(1000-850 mbs) 


Entire North Atlantic Ocean and Mediterranean Sea 


Model forecast projection: 1200GMT (TAU-24) 


Medel outeuwe 
par anerer 


D1OO00 
B925 
D850 
By O00 * 
D500 
D400 * 
P3200 * 
B250 * 
TAIR 
T1000 
Eg 25 
m700 * 
2 0'0 
T400 * 
c300 * 
m5 0 * 
EAIR 


Peavy) July 1983 


Descriptive name of parameter 


oOo emoeceoporecntial height 


925 mb geopotential height 
800 mb geopotential height 
(MimmMomseouoceontila Ll height 
SU0 7 Mmbsgeepocential height 
400 mb geopotential height 
300 mb geopotential height 
2505mby geepotential height 


Surface air temperature 


LOCOS eeniperature 


925 mb temperature 
700 mb temperature 
500 mb temperature 
400 mb temperature 
300 mb temperature 
2 eiiomee moc ta buine 


Sie race, Vaperm pressure 


Ox: 


E1000 
EZ 
E850 
OOS 
E500 
UBLW 
ULOOO 
UI25 
APF) 
U500 
U400 
U0 0s 
UZ50" 2 
VBLW 


V1OO00 


1000 mb vapor pressure 

925 mb vapor pressure 

850 mb vapor pressure 

700 mb vapor pressure 

200 mb vapor pressure 

Boundary layer zonal wind component 
1000 mb zonal wind component 


925 mb zonal wind component 


700 
500 
400 
300 
250 


mb 
mb 
mb 
mb 
mb 


zonal 
zonal 
zonal 
zonal 


zonal 


wind 
wind 
wind 
wind 


wind 


component 
COMPBoMemts 
COMpCnen 
COMmpeenenis 


component 


Boundary layer meridional wind 
component 


1000 mb meridional wind component 


V9i2Z5 
V850 
V700 
V¥500 
V400 
V300 
VZ50 


* 


* 


x 


VOR925 
VOR500 
PS 

SMP 
PBLD 
STRTFO 
STRITH 
SHF 
ENTRN 


DRAG 
PREC. 


925 mb meridional wind component 
850 mb meridional wind component 
700 mb meridional wind component 
500 mb meridional wind component 
400 mb meridional wind component 
300 mb meridional wind component 
250 mb meridional wind component 
92> Mb VOpLae ma 

500 Mb vVorelore, 


SUriace pressure 

Surface moisture flux 
Planetary boundary-layer depth 
Percent Straeus sr eeuene, 
Stratus thickness 

Surface heat flux 


Entrainment at top of marine 
boundary-layer 


Drag ceecfiriervent (C,) 


Total amount (mm.) of model 
precipitation in the last six hemes 
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er. 


SHWRS 


JUS Sabyeds: 
Day 25 


Total amount (mm.) of model precipita- 
PIOMmeasseectatea With cumulus convection 
in the last six hours 


Boundary layer inversion instability 


925 mb Divergence 


Derived parameters 


DTDP 


DEDP 


DUDP 


DVDP 


RH 
ee 
DDVDP 


DVRTDP 


ESUM 


EP RD 


Vertical gradient of temperature 
Clo Gn— 9 25am s ) 


Vertical gradient of vapor pressure 
(1000-850 mbs) 


Vertical gradient of zonal wind 
(1000-850 mbs) 


Vertical gradient of meridional wind 
(1000-850 mbs) 


Surface relative humidity 
Virtual temperature 


Vertical gradient of geopotential height 
(1000-850 mbs) 


Verercaleqradtent Of vorticity 
(500-925 mbs) 


Sum of vapor pressures 
CROOC=S 50° mis) 


Product of vapor pressures 
(L000-850 mbs) 


Area: Entire North Atlantic Ocean and Mediterranean Sea 


Mode iOLncecacemmnojcctiom. | I200GMT (TAU-48) 


Model output 
parameter 
D1O000 

B25 

D850 

B00 * 

D500 

D400 * 


15 May--9 July 1983 


Descriptive name of parameter 


1000 mb geopotential height 
925 mb geopotential height 
850 mb geopotential height 
700 mb geopotential height 
5900 mb geopotential height 
SCs geopotential height 
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bse” * 
BZ our 


300 mb geopotential height 
250 mb geopotential height 


TAGE Surface air temperature 


T1LOOO 1000 mb temperature 

noe 925 mb temperature 

L7oOys 700 mb temperature 

PSCG 500 mb temperature 

40105 <4 400 mb temperature 

TS Cont 300 mb temperature 

EZ 07% 250 mb temperature 

EAIR Surface vapor pressure 

E1000 1000 mb vapor pressure 

Bei 925 mb vapor pressure 

E850 850 mb vapor pressure 

EO0 700 mb vapor pressure 

E500 500 mb vapor pressure 

UBLW Boundary layer zonal wind component 

ULOOO 1000 mb zonal wind component 

W225 925 mb zonal wind component 

U850 850 mb zonal wind component 

U0. = 700 mb zonal wind component 

U500 500 mb zonal wind component 

U400 * 400 mb zonal wind component 

OLS 1018 Fs 300 mb zonal wind component 

UZ50 ~* 250 mb zonal wind component 

VBLW Boundary layer meridional wind 
component 

V1000 1000 mb meridional wind component 

V925 925 mb meridional wind component 

V850 850 mb meridional wind component 

V700"* 700 mb meridional wind component 

V500 500 mb meridional wind component 

V400 400 mb meridional wind component 

V300 300 mb meridional wind component 

Vz250 250 mb meridional wind component 
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-_ i = 


WORa9 Zo 
VOR500 
eS) 

SMF 
Ee) 
SlRTEO 
pen Lio 
Sig 
ENTRN 


DRAG 
PRECIP 


SHWRS 


INSTAB 
Bayo 25 


O23 bE venilci ty 

500 mem verneicity 

SUB aAceE Pressure 

Surface moisture Flux 
Planetary boundary-layer depth 
Percenme stratus frequency 
Stratus thickness 

Surface heat flux 


Entrainment at top of marine 
pOUMel a y— lay cr 


DrageececrtnVve1 ent (Cp) 


Total amount (mm.) of model precipitation 
in the last six hours 


Total amount (mm.) of model precipitation 
associated with cumulus convection in 
the last six hours 


Boundary layer inversion instability 


Pilon eng enee 


Derived parameters 


DTDP 


DEDP 


DUDP 


DVDP 


RH 
TV 
DDVDP 


DV RTDP 


ESUM 


ie RD 


Vertical gradient of temperature 
(1000-925 mbs) 


Vertical. gradient of vapor pressure 
(1000-850 mbs) 


Vertical gradient of zonal wind 
COO e507 ms) 


Vertical gradient of meridional wind 
(1000-850 mbs) 


Surface relative humidity 
Virtual temperature 


Vemure wi mCmagdionraor Geoporential height 
(1000-850 mbs) 


VemmEcoalmoLaAdtent Or VORELCIEY 
(500-925 mbs) 


SUlmOn evapo pressures 
Coo 8 Oe mbps) 


Product of vapor pressures 
(1000-850 mbs) 


KOS 


k* 


Parameters which were not used due to their 
being considered as physically unimportant 
in forecasting marine Vistpwere] 


Parameters which were not used due to loss of 


significant digits during transfer from tape 
tO MaSS Seorage, 
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APPENDIX C 


Siew tinnAn SeORES, DEFINITIONS [Karl, 1984] 





—_ 

Ww) 

< 

O 

Ww) 

Cc 

O 

LL 

OBSERVED 

Mecal =  R +S + TF +U+ V+ WwW + X + ¥Y + Z 
Pl = £4(R+U+X) /Total P3 = (T+W+Z)/Total 
pee = (StVtTY) Total PN = greatest of Pl, P2 or P3 
Raw Scores 
AQ = $% correct = (X+V+T)/Total 
Al = one-class error = £(U+S+Y+W)/Total 
mol = Ihreat score for visibility category I 


= X/(R+U+X+Y+Z) 


isa = Threat score for visibility category II 


= V/(S+V+Y+U+W) 


Mets —  Whreat score for visibility categories I and II 


(X+V) /(Total-T) 
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TS12 is designed to represent the skill Gf fereeas eum 
visibility categories I and II as separate categories, 
rather than their skill as a combined category, which 
would be (U+V+X+Y)/(Total-T). 


Adjusted scores 


AAO = (AO-PN)/(1-PN) 

ATS1 = (TS1-P1)/(1-P1) 

ATS2 = (TS2-P2)/(1-P2) 

ATS12 = (TS12-(P1+P2))/(1-(P1+P2) ) 
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tet. 


ieET . 


APE ENDEA D 


BMDP LINEAR REGRESSION EQUATION PREDICTOR SETS, 
NORTH ATLANTIC OCEAN EXPERIMENTS (PR+BMD MODEL) 


Area 2, TAU-00 (BMDP P2R) 


BMD1 


BMD2 


BMD3 


Area 


BMD1 


BMD2 


BMD3 


Area 2, 


BMD1 


Z 


f 


ZO eee 7 aoa) + 901.837 062E—05*D500 
POO Sot 2057 “DPDP +1 0)05872*ESUM ** 


—~LO.4469 ~~ 0. lies4*BAIR + 0.10124*SMP 
=r 409 FEo 2 5 en 6401 *E925 


Sa oe ee EM eed O6116*DEDP 
tT Orwiels 2 +h PRE 


TAU-24 (BMDP P9R) 


=Z0mor eo — O20 70S*E850 = 0.078694*T925 
TURD 3 36/4 SoHE — O00316725* LNSTAB 
PEO 7597 ly + U0 de7 965 *houM 


Zo kOe Oe Oooo L0>*TM + 0,53048E-04*V500 
Oras 5 +2207 64 7D DP 
POZO. DDVMP 4 0 .00563327*EPRD ** 


ee eee oe Ure Sole Fm APS. + 002735 75*TS500 
PeU 0049555 ~PELD —~10,00625203*STRIFO 
=U so Ooo col Ril 0200272694 -DTDP 


TAU-48 (BMDP P9R) 


oreo = Ornb4 0 S4*E 500 —207.70897567*T925 


ec Voom tO. O22 0 Snr te. 008605/4*RHA 
tT Uw oo oy hy, 
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BMD2 = 1.85487 + 0.0777253*TM - 0.0266753*E850 

-~ 0.0000390116*U500 - 0.0000366663*V500 

+ 0.0240246*DDVDP + 0.105648*ESUM ** 
BMD3 = -13.9637 + 0.0160572*PS + 0.00308705*PBLD 


- 0.0031323*STRIFO — 0.00846443-DiurE 
+ 25.6871*DEDP — 0. 002963427 bei 


IV. Area 3W, TAU-24 (BMDP P2R) 


BMD1L = 2.673 —- 0.09363*ES50 —"0) @3 ie. 
0.20451*E925 + 0.0305*SHF + 0 .1Sslit tae 


BMD2 = 1.15536 + 0.16326“*EAIR + GeO 4a 
+ 0.13014E-047DM +38.03795 fee 
+ 0.02091*DDVDP -— "0.0072 ceErRE 


BMD3 = -18.55031 + 0.02089*PS - OSs0G4¢en 065 2m 
= 0.01LI51*STRIFO + 010277246. 
=. 03,0572 62 D0UDE 


V. Area 3W, TAU-48 (BMDP P9R) 


BDMlL = 1.92874 —- 0.0719817*7925 = 03201062 
+ 0.0376905*SHF + 7.66796*DEDP + 0.182703 22 
= 000525094 7b eo Rp 

BMD2 = -33.2574 - 0.1459*E850 — 0.,000205443. 7-7 
+ 0.325802*SHWRS + 0.0168064*RH + 0.124434. 
+0024 7277 40D) DP ea 

BMD3 = -10.1316 + 0.0126085*PS —S05000 3240537 


+ 0.000112099*U5S00 - Of00e COG c {Soria 
+ 0.0159356*STRITH — 0.001745) 


iG 


VI. Area 4, TAU-00 (BMDP P2R) 


BMD — 2.060704 Ue 4orh—04*U500 
sem o4oe2h wa 200 — 0 .30475h=02*5TRIFO 
Bee 27 OL >on O ao aon -04*DM — 0.00904*STRTTH 
TH USORocoscHe et e4.09377*DIDP ** 
BPome=  2,o¢00) + 0. 24549E-03*VBLW — 0.10113*E850 


tre Sooo — > sold + 0032/3 *ESUM 


VII. Area 4, TAU-24 (BMDP P9R) 


SOO OO 75507 “ae 0. 0000464491 *V500 
SOO 205 -Lo2 53 =0557/6267*VRTI25 
Orava toa g el DP + 9. 51302*DEDP 


il 
Lo 


BMD1 


Mpa O08 o5027H 500 +°0.00372204*PBLD 
—Om042 2342 SURTPOs+ 0 .0154289-SHFr 
=ewoo Loa oe twee Ol43745“*DVDP — ** 


II 
Lo 


BMD2 


ao o or O27 5966 4P5)— 90 .0549077*E850 
Porm oo // Po O0et O.0140852*4INSTAB 
TOPs Ciow io ves OO 2264 ~DDVDP 


BMD3 


VIII. Area 4, TAU-48 (BMDP P9R) 


Ae on eae sO Oro 262° 650 FOO. 0L616*T500 
Urine oe Abo eet 0. 207/098 SHWRS 
eo oOo Os LS SS“TY | *4 


Pee bolo e +) 0.0067 8538*PSs = 0.0850887*E500 
SOOO S022 sWo0' 0 — 00000596 501*%V500 
Ome 7 dole olm THO o— O07 00917 554*RH 


Mea 


BMD3 


*x* 


2.08319 + 0.0771067*IM -"°O%0Zc7Z7 ee 
- 1.5829*VRT925 —= 0.001147 ie 
+ 13.2073*DEDP + 0.028932 bs BE 


Equation selected as predictors in the PR+BMD model 


dal 


APPENDIX £& 
BMDP LINEAR REGRESSION EQUATION PREDICTOR SETS 
FOR TWO-STAGE THRESHOLD MODELS 
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VI. Area 4, TAU-00 Threshold Equations (BMDP P2R) 
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Figure 1. Proposed U.S. Navy Model Output Statistics 
(MOS) development schedule 
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DEPENDENT DATA 


10 =78.18%as0-0.52% 
172 1278 
A1=15.28% 
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a1-13.42% 
ts1= 0.34 atsi- 0.29 





FORECAST 
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TS2= 0.114 AtTS2--0.01 





T12—0.22 41S12> 0.04 


Figure 17. Contingency table results for the area 2, 
TAU-24 equal variance threshold model 
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TS27= 9,14 ATS2=0.03 
TS12=0.29 Ats12= 0.0 


1 2 3 
OBSERVED 


INDEPENDENT -DATSA 


406 8.60% 440= -63.31% 


A1=27,53% 
T81=0.34 atsi-0.29 
™S2=0.11 ats2- 0.0 


FORECAST 


TS12= 0,19 atsi2: 0.0 
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Figure 18. Contingency table results for the aneame 
TAU-24 quadratic threshold model 
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DEPENDENT DATA 


so-77.59%aa0o- -0.73% 


41=15.87% 
181-0.29 atsi- 0.21 


FORECAST 


152=0.07 ats2=-0.07 
19122 0.17 atsi2= -0.07 
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INDEPENDENT DATA 


00=80.11%AA0= 7.07% 


a17=13.62% 
™TS$170.36 atsi= 0.29 


FORECAST 


™62=0.11 2ats2--0.01 





TS127 9.23 atsi2= 0.02 


2 
OBSERVED 


Figure 25. Contingency table results for the area 2, 
Pie emecuale var lance thnreshetd model 
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DEPENDENT DATA 


A0=77.43% 4A0--1.46% 
ai1= 15.39% 
5120.29 artsi=- 0.21 
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TS2=0.07 ats2=:-0.06 





TS12= 0.17 ATS12=-0 07 
1 2 
OBSERVED 


INDEPENDENT DATA 


Figure 26. 


10-7 9.68% aro- 5.05% 


Al? 13.62% 
™$1-=0.36 = atsi: 0.29 


FORECAST 


™TS2= 0.09 ats2=-0.03 
T$12=0.22 atTS12=0.01 





OBSERVED 


Contingency table results for the area 2, 
TAU-48 quadratic threshold model 
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DEPENDENT DATA 


A0=73.00%AA0-=13.77% 
at= 17.17% 
TS1=0.38 3 ats1= 0.23 


FORECAST 


1s2= 0.12 ars2--0.01 
TS812= 0.29 atsi2= -0.04 





OBSERVED 


INDEPENDENT DATA 


a0 1.00% aao= 9.29% 
a1718.67% 

TS12? 0.33 ats1: 0.17 
TS2= 0.11 ats2--0.01 


FORECAST 


1s12=0.26 atsi2= -0.09 





OBSERVED 


BEegure 35. Contingency table results for the area 3W, 
TAU-24 equal variance threshold model 


ys) 


DEPENDENT DATA 

10=73.22% 140 -14.45% 
ai-17.60% 

1S1= 0.37 | ars1-0 22 


FORECAST 


TS2= 0.11 ats2- —-0.01 
T$12= 0.28 atsi2=-0.04 





INDEPENDENT DATA 
aor? 1.43% an0-10.62% 
ai-18.95% 


TS$1= 0.32 atsi1-0.16 
T8220.11 ats2--0.02 


FORECAST 


T8122=0.25 atsi2=-70.10 





OBSERVED 


Figure 34. Contingency table results for the areame 
TAU-24 quadratic threshold model 
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DEPENDENT DATA 


A0=7 1.49% as0-10.92% 
A1=17.42% 
T81-=0.382 atsi=- 0.15 


FORECAST 


ts2=0.19 ats2--0.03 
1T812=0.25 atsi2=-0.10 





GESERVY ED 


INDEPENDENT DATA 


ps4 | at fio 
1 2 3 


OBSERVED 


ao*=7 1.60% 4a0:13.17% 





A1=18.17% 
t51-=0.31 3 ats1-:0.17 





FORECAST 





TS2=0.11  AtTS2=-0.04 





TS$12=2 0.25 ATS12=—-Q, 1 1 


Figure 41. Contingency table results for the area 3W, 
TAU-48 equal variance threshold model 


oe 


DEPENDENT DATA 


© 00=:71.22% aao- 10.08% 
ai=17.82% 
ts1- 0.31 ats:-0.14 


FORECAST 


TS82=0.10 ats2--0.03 
TS12=Q0.25 arsi2=-0.11 





2 
OBSERVED 


INDEPENDENT DATA 
AO«7 1.47% aao-12.76% 
a1- 18.71% 


™$170.32 atsi=0.17 
ts2= 0.11 ats2--0.04 


FORECAST 


151220.25 atsi2=-0.11 





OSS EH VED 


Figure 42. Contingency table results for the reams 
TAU-48 quadratic threshold model 
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DEPENDENT DATA 


00 -82.85% aso=-12.25% 
ai-13.85% 
1s1=0.08 ATS1=90.05 


FORECAST 


1S2=0.08 ats2=-0.05 
ts12=0.08 artsi2=-0.09 





2 
OBSERVED 


INDEPENDENT DATA 


ps | m | os 
Z | 


OBSERVEO 


£0283.59% Aao--13.15% 
a1=13.27% 






151-0.07 ATS1=: 0.05 
ts2=0.08 arts2--0.05 


FORECAST 


ts12=0.08 ,41s12=-0.08 


Bore 55. ~“Comtingency table results for the area 4, 
TAU-24 equal variance threshold model 
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Figure 62. Contingency table results for tiesageoams 
TAU-48 equal variance threshold model 
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Figure 68. Contingency table results for the minimum 
probable error threshold model (EVAR) for 
area 4, TAU-24. The contingency tables reflect 
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versus VISCAT II separations 
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